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NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
60/142,311, filed July 2, 1999, entitled "Novel Human Genes and Gene Expression 
5 Products" and also U.S. Provisional Application No. 60/142,310, filed July 2, 1999, 
entitled "Novel Human Genes and Gene Expression Products". 

FIELD OF THE INVENTION 

The present invention relates to novel polynucleotides of human origin and 
the encoded gene products. 

1 0 BACKGROUND OF THE INVENTION 

Identification of novel polynucleotides, particularly those that encode an 
expressed gene product, is important in the advancement of drug discovery, diagnostic 
technologies, and the understanding of the progression and nature of complex diseases such 
as cancer. Identification of genes expressed in different cell types isolated from sources 

15 that differ in disease state or stage, developmental stage, exposure to various environmental 
factors, the tissue of origin, the species from which the tissue was isolated, and the like is 
key to identifying the genetic factors that are responsible for the phenotypes associated 
with these various differences. 

This invention provides novel human polynucleotides, the polypeptides 

20 encoded by these polynucleotides, and the genes and proteins corresponding to -these novel 
polynucleotides. 

SUMMARY OF THE INVENTION 

This invention relates to novel human polynucleotides and variants thereof, 
their encoded polypeptides and variants thereof, to genes corresponding to these 
25 polynucleotides and to proteins expressed by the genes. The invention also relates to 
diagnostics and therapeutics comprising such novel human polynucleotides, their 
corresponding genes or gene products, including probes, antisense nucleotides, and 
antibodies. The polynucleotides of the invention correspond to a polynucleotide 
comprising the sequence information of at least one of SEQ ID NOs: 1-3351. 
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Various aspects and embodiments of the invention will be readily apparent 
to the ordinarily skilled artisan upon reading the description provided herein. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to polynucleotides comprising the disclosed nucleotide 
5 sequences, to full length cDNA, mRNA genomic sequences, and genes corresponding to these 
sequences and degenerate variants thereof, and to polypeptides encoded by the polynucleotides 
of the invention and polypeptide variants. 

Polypeptide variants differ from wild type protein in having one or more amino 
acid substitutions that either enhance, add, or diminish a biological activity of the wild type 
10 protein. 

Six of the polypeptides disclosed herein encode new members of the MKK 
kinase family; the coding region is found within the nucleotide region in parentheses: SEQ ID 
NO:29 (nucleotides 295-421); SEQ ID NO:31 (298-397); SEQ ID NO: 196 (37-322); SEQ ID 
NO:3175 (nucleotides 14-164); SEQ ID NO:3190 (229-390); and SEQ ID NO:3281 (15- 

15 182). Twenty-four of the polypeptides encode new members of the family of transcription 
factor proteins having a basic region plus leucine zipper: SEQ ID NO:410 (42-191); SEQ ID 
NO:552 (1 16-288); SEQ ID NO:768 (1 16-288); SEQ ID NO:822 (108-262); SEQ ID NO:836 
(158-353); SEQ ID NO: 1 288 (73-234); SEQ ID NO: 1365 (69-257); SEQ ID NO: 1540 (289- 
471); SEQ ID NO:1549 (200-391); SEQ ID NO:1556 (163-354); SEQ ID NO:1557 (207-398); 

20 SEQ IDNO:1563 (107-298); SEQ IDNO:1622 (180-365); SEQ ID NO: 1630 (100-291); SEQ 
ID NO.1704 (184-372); SEQ ID NO.1808 (36-161); SEQ ID NO:1454 (49-209); SEQ ID 
NO:2363 (48-211); SEQ ID NO:2424 (43-194); SEQ ID NO:3147 (190-369); SEQ ID 
NO:3152 (129-320); SEQ ID NO:3158 (167-334); and SEQ ID NO:3208 (34-256). 

SEQ ID NOs:186 (175-395); 2591 (60-165); 3307 (43-321); and 3339 (94- 

25 342) encode polypeptides having an SH2 domain, and SEQ ID NOs:234 (23-121), 1832 (18- 
173), and 1835 (57-206) encode polypeptides having an SH3 domain. Nine polypeptides 
encode new members of the family of proteins having Ank repeat regions: SEQ ID NO: 187 
(358-432); SEQ ID NO: 1268 (238-315); SEQ ID NO: 1804 (301-378); SEQ ID NO:1819 (278- 
355); SEQ ID NO:1839 (224-307); SEQ ID NO:1830 (184-267); SEQ ID N0.2562 (18-101); 

30 SEQ IDNO:3015 (131-214); and SEQ IDNO:3267 (97-180). 

The following eleven polynucleotides encode polypeptides having a C2H2 type 
zinc finger: SEQ ID NOs:308 (110-172); 807 (339-392); 1324 (294-356); 1503 (154-216); 
1527 (156-212); 1674 (196-258); 1779 (64-126); 1801 (295-351); 3081 (190-252); 3193 
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(293-355); and 3306 (161-223). Eight polynucleotides encode polypeptides of the family of 
ATPases: SEQ ID NOs:431 (71-428); 639 (157-561); 2135 (2-401); 2684 (9-461); 2859 
(100-320); 3178 (45-386); 3197 (281-343) and 3266 (8-139). Polypeptides having a 
fibronectin type III domain are encoded by SEQ ID NO:746 (209-427) and 1192 (186-416). 
5 Polypeptides having an EF-hand domain are encoded by SEQ ID NO:820 (341-406); 1755 
(281-367) and 3285(16-102). Six polypeptides of the protein kinase family are encoded by 
SEQ ID NOs: 1 157 (41-444); 1478 (54-437), 1496 (241-520); 2286 (12-182); 2969 (5-387); 
and 3190 (118-390). 

LIM domain-containing polypeptides are encoded by SEQ ID NO: 1269 (79- 

10 240); 1309 (248-404); 1360 (222-377); and 1386 (243-398). Two polypeptides of the family 
having a C2 domain (protein kinase C-like) are encoded by SEQ ID NO: 1325 (1-234) and 
2282(183-353). Polypeptides having a WD domain, G-beta repeat motif are encoded by SEQ 
ID NOs.i336 (66-164); 1380 (42-140); 1711 (263-361); 1762 (236-334); 1909 (160-258); 
2218 (127-225); 3047 (191-292); 3108 (275-367) and 3292 (208-300). 

15 SEQ ID NO:1410 (222-350) encodes a member of the trypsin family. SEQ ID 

NOs: 14 17 (8-354); 2281 (20-387) and 2310 (20-371) encode members of the protein tyrosine 
phosphatase family. SEQ ID NOs:1464 (4-180) and 1514 (2-252) encode members of the 
family having an RNA recognition motif (also known as RRM, RBD, or RNP domain). SEQ 
ID NOs: 1496 (241-520) and 3297(7-153) encode helicases having a conserved C-terminal 

20 domain. SEQ ID NO: 1538 (9-635) encodes a member of the wnt family of developmental 
signaling proteins. 

Three polynucleotides encode polypeptides having a homeobox domain: SEQ 
ID NOs: 1676 (9-86); 1820 (123-299); and 1821 (127-303). A novel thioredoxin is encoded by 
SEQ ID NO: 1677 (316-369). Two novel members of the ras family are encoded by SEQ ID 

25 NO: 1688(1 09-4 10) and 3258(138-394). A novel polypeptide having a phosphatidylinositol- 
specific phospholipase C Y-domain is encoded by SEQ ID NO: 1707 (92-439). A novel serine 
carboxypeptidase is encoded by SEQ ID NO: 1744 (238-433). A novel polypeptide having N- 
terminal homology in the Ets domain is encoded by SEQ ID NO:1811 (184-315). A novel 
polypeptide having a bromodomain is encoded by SEQ ID NO: 1814 (127-294). A novel 

30 polypeptide having a double-stranded RNA binding motif is encoded by SEQ ID NO: 1818 (9- 
146). A novel polypeptide having a G-protein alpha subunit is encoded by SEQ ID NO: 1846 
(12-398). 

SEQ ID NOs:1911 (35-151) and 1980 (60-197) encode polypeptides having 
a C3HC4 type zinc finger domain (RING finger). SEQ ID NO:2065 (253-306) encodes a 
35 polypeptide having a CCHC zinc finger domain. SEQ ID NO:2216 (90-179) encodes a 
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polypeptide having a WW/rsp5/WWP domain. SEQ ID NO:2428 (25-350) encodes a 
polypeptide member of the dual specificity phosphatase family, having a catalytic domain. 

SEQ ID NOs:2577 (0-311); 3183 (14-215); and 3195 (0-215) encode 
members of the 4 transmembrane segment integral membrane protein family. SEQ ID 
5 NOs:2826 (116-400) and 2871 (198-392) encode polypeptides of the DEAD and DEAH 
box helicase family. SEQ ID NO:2944 (18-281) encodes a polypeptide having a calpain 
large subunit, domain III. 

SEQ ID NO:3274 (1 1-187) encodes a eukaryotic transcription factor with a 
fork head domain. SEQ ID NO:3345 (65-271) encodes a polypeptide having a PDZ 
10 domain, and SEQ ID NO:3351 (124-270) encodes a polypeptide in the family of phorbol 
esters/glycerol binding proteins. 

Described below are polynucleotide compositions encompassed by the 
invention, methods for obtaining cDNA or genomic DNA encoding a full-length gene product, 
expression of these polynucleotides and genes, identification of structural motifs of the 
15 polynucleotides and genes, identification of the function of a gene product encoded by a gene 
corresponding to a polynucleotide of the invention, use of the provided polynucleotides as 
probes and in mapping and in tissue profiling, use of the corresponding polypeptides and other 
gene products to raise antibodies, and use of the polynucleotides and their encoded gene 
products for therapeutic and diagnostic purposes. 

20 Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions 
includes, but is not necessarily limited to, polynucleotides having a sequence set forth in 
any one of SEQ ID NOs: 1-3351; polynucleotides obtained from the biological materials 
described herein or other biological sources (particularly human sources) by hybridization 

25 under stringent conditions (particularly conditions of high stringency); genes corresponding 
to the provided polynucleotides; variants of the provided polynucleotides and their 
corresponding genes, particularly those variants that retain a biological activity of the 
encoded gene product (e.g., a biological activity ascribed to a gene product corresponding 
to the provided polynucleotides as a result of the assignment of the gene product to a 

30 protein family(ies) and/or identification of a functional domain present in the gene 
product). Other nucleic acid compositions contemplated by and within the scope of the 
present invention will be readily apparent to one of ordinary skill in the art when provided 
with the disclosure here. "Polynucleotide" and "nucleic acid" as used herein with reference 
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to nucleic acids of the composition is not intended to be limiting as to the length or 
structure of the nucleic acid unless specifically indicated. 

The invention features polynucleotides that are expressed in human tissue, 
specifically human colon, breast, and/or lung tissue. Novel nucleic acid compositions of 
5 the invention comprise a sequence set forth in any one of SEQ ID NOs: 1-3351 or an 
identifying sequence thereof. An "identifying sequence" is a contiguous sequence of 
residues at least about 10 nt to about 20 nt in length, usually at least about 50 nt to about 
100 nt in length, that uniquely identifies a polynucleotide sequence, e.g., exhibits less than 
90%, usually less than about 80% to about 85% sequence identity to any contiguous 

10 nucleotide sequence of more than about 20 nt. Thus, the subject novel nucleic acid 
compositions include full length cDNAs or mRNAs that encompass an identifying 
sequence of contiguous nucleotides from any one of SEQ ID NOs: 1-3351. 

The polynucleotides of the invention also include polynucleotides having 
sequence similarity or sequence identity. Nucleic acids having sequence similarity are 

15 detected by hybridization under low stringency conditions, for example, at 50°C and 
10XSSC (0.9 M saline/0.09 M sodium citrate) and remain bound when subjected to 
washing at 55°C in 1XSSC. Sequence identity can be determined by hybridization under 
stringent conditions, for example, at 50°C or higher and 0.1XSSC (9 mM saline/0.9 mM 
sodium citrate). Hybridization methods and conditions are well known in the art, see, e.g., 

20 U.S. Patent No. 5,707,829. Nucleic acids that are substantially identical to the provided 
polynucleotide sequences, e.g., allelic variants, genetically altered versions of the gene, 
etc., bind to the provided polynucleotide sequences (SEQ ID NOs:l-3351) under stringent 
hybridization conditions. By using probes, particularly labeled probes of DNA sequences, 
one can isolate homologous or related genes. The source of homologous genes can be any 

25 species, e.g., primate species, particularly human; rodents, such as rats and mice; canines, 
felines, bovines, ovines, equines, yeast, nematodes, etc. 

Preferably, hybridization is performed using at least 15 contiguous 
nucleotides (nt) of at least one of SEQ ID NOs: 1-3351. That is, when at least 15 contiguous 
nt of one of the disclosed SEQ ID NOs. is used as a probe, the probe will preferentially 

30 hybridize with a nucleic acid comprising the complementary sequence, allowing the 
identification and retrieval of the nucleic acids that uniquely hybridize to the selected 
probe. Probes from more than one SEQ ID NO. can hybridize with the same nucleic acid if 
the cDNA from which they were derived corresponds to one mRNA. Probes of more than 
15 nt can be used, e.g., probes of from about 18 nt to about 100 nt, but 15 nt represents 

35 sufficient sequence for unique identification. 
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The polynucleotides of the invention also include naturally occurring 
variants of the nucleotide sequences (e.g., degenerate variants, allelic variants). Variants of 
the polynucleotides of the invention are identified by hybridization of putative variants 
with nucleotide sequences disclosed herein, preferably by hybridization under stringent 
5 conditions. For example, by using appropriate wash conditions, variants of the 
polynucleotides of the invention can be identified where the allelic variant exhibits at most 
about 25-30% base pair (bp) mismatches relative to the selected polynucleotide probe. In 
general, allelic variants contain 15-25% bp mismatches, and can contain as little as even 5- 
15%, or 2-5%, or 1-2% bp mismatches, as well as a single bp mismatch. 

10 The invention also encompasses homologs corresponding to the 

polynucleotides of SEQ ID NOs: 1-3351, where the source of homologous genes can be any 
mammalian species, e.g., primate species, particularly human; rodents, such as rats; 
canines, felines, bovines, ovines, equines, yeast, nematodes, etc. Between mammalian 
species, e.g., human and mouse, homologs generally have substantial sequence similarity, 

15 e.g., at least 75% sequence identity, usually at least 90%, more usually at least 95% 
between nucleotide sequences. Sequence similarity is calculated based on a reference 
sequence, which may be a subset of a larger sequence, such as a conserved motif, coding 
region, flanking region, etc. A reference sequence will usually be at least about 18 
contiguous nt long, more usually at least about 30 nt long, and may extend to the complete 

20 sequence that is being compared. Algorithms for sequence analysis are known in the art, 
such as BLAST, described in Altschul et ah, 1 Mol Biol (1990) 275:403-10. 

In general, variants of the invention have a sequence identity greater than at 
least about 65%, preferably at least about 75%, more preferably at least about 85%, and can 
be greater than at least about 90%, 91%, 92%, 93%, 94%, 95%, or 96%, most preferably 

25 97%, 98% or 99%. For the purposes of this invention, a preferred method of calculating 
percent identity is the Smith- Waterman algorithm, using the following. Global DNA 
sequence identity must be greater than 65% as determined by the Smith-Waterman 
homology search algorithm as implemented in MPSRCH program (Oxford Molecular) 
using an affine gap search with the following search parameters: gap open penalty, 12; and 

30 gap extension penalty, 1 . 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as 
fragments thereof, particularly fragments that encode a biologically active gene product 
and/or are useful in the methods disclosed herein {e.g., in diagnosis, as a unique identifier 
of a differentially expressed gene of interest, etc). The term "cDNA" as used herein is 

35 intended to include all nucleic acids that share the arrangement of sequence elements found 
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in native mature mRNA species, where sequence elements are exons and 3' and 5' non- 
coding regions. Normally mRNA species have contiguous exons, with the intervening 
introns, when present, being removed by nuclear RNA splicing, to create a continuous open 
reading frame encoding a polypeptide of the invention. 
5 A genomic sequence of interest comprises the nucleic acid present between 

the initiation codon and the stop codon, as defined in the listed sequences, including all of 
the introns that are normally present in a native chromosome. It can further include the 3' 
and 5' untranslated regions found in the mature mRNA. It can further include specific 
transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., 

10 including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 s and 3' 
end of the transcribed region. The genomic DNA can be isolated as a fragment of 100 kbp 
or smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 
flanking the coding region, either 3' and 5', or internal regulatory sequences as sometimes 
found in introns, contains sequences required for proper tissue, stage-specific, or disease- 

1 5 state specific expression. 

The nucleic acid compositions of the subject invention can encode all or a 
part of the subject polypeptides. Double or single stranded fragments can be obtained from 
the DNA sequence by chemically synthesizing oligonucleotides in accordance with 
conventional methods, by restriction enzyme digestion, by PCR amplification, etc. Isolated 

20 polynucleotides and polynucleotide fragments of the invention comprise at least about 10, 
about 15, about 20, about 35, about 50, about 100, about 150 to about 200, about 250 to 
about 300, or about 350 contiguous nt selected from the polynucleotide sequences as 
shown in SEQ ID NOs: 1-3351. The fragments also include those of lengths intermediate to 
the specifically mentioned lengths, such as 35, 36, 37, 38, 39, etc.; 150, 151, 152, 153, 154, 

25 etc. For the most part, fragments will be of at least 1 5 nt, usually at least 1 8 nt or 25 nt, and 
up to at least about 50 contiguous nt in length or more. In a preferred embodiment, the 
polynucleotide molecules comprise a contiguous sequence of at least 12 nt selected from 
the group consisting of the polynucleotides shown in SEQ ID NOs: 1-3351. 

Probes specific to the polynucleotides of the invention can be generated 

30 using the polynucleotide sequences disclosed in SEQ ID NOs: 1-3351. The probes are 
preferably at least about a 12, 15, 16, 18, 20, 22, 24, or 25 nt fragment of a corresponding 
contiguous sequence of SEQ ID NOs: 1-3351, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb 
in length. The probes can be synthesized chemically or can be generated from longer 
polynucleotides using restriction enzymes. The probes can be labeled, for example, with a 

35 radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based upon an 
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identifying sequence of a polynucleotide of one of SEQ ID NOs: 1-3351. More preferably, 
probes are designed based on a contiguous sequence of one of the subject polynucleotides 
that remain unmasked following application of a masking program for masking low 
complexity (e.g., XBLAST) to the sequence., i.e., one would select an unmasked region, as 
5 indicated by the polynucleotides outside the poly-n stretches of the masked sequence 
produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in 
substantial purity, generally as other than an intact chromosome. Usually, the 
polynucleotides, either as DNA or RNA, will be obtained substantially free of other 

10 naturally-occurring nucleic acid sequences, generally being at least about 50%, usually at 
least about 90% pure and are typically "recombinant", e.g., flanked by one or more 
nucleotides with which it is not normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or 
within a circular molecule, and can be provided within autonomously replicating molecules 

15 (vectors) or within molecules without replication sequences. Expression of the 
polynucleotides can be regulated by their own or by other regulatory sequences known in 
the art. The polynucleotides of the invention can be introduced into suitable host cells 
using a variety of techniques available in the art, such as transferrin polycation-mediated 
DNA transfer, transfection with naked or encapsulated nucleic acids, liposome-mediated 

20 DNA transfer, intracellular transportation of DNA-coated latex beads, protoplast fusion, 
viral infection, electroporation, gene gun, calcium phosphate-mediated transfection, and the 
like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological samples 

25 (e.g., extracts of human cells) to generate additional copies of the polynucleotides, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes or as 
triple-strand forming oligonucleotides. The probes described herein can be used to, for 
example, determine the presence or absence of the polynucleotide sequences as shown in 
SEQ ID NOs: 1-3351 or variants thereof in a sample. These and other uses are described in 

30 more detail below. 

Use of Polynucleotides to Obtain Full-Length cDNA, Gene, and Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are 
obtained as follows. A polynucleotide having a sequence of one of SEQ ID NOs: 1-3351, 



8 



PP01 598.002/ 200130.512 



or a portion thereof comprising at least 12, 15, 18, or 20 nt, is used as a hybridization probe 
to detect hybridizing members of a cDNA library using probe design methods, cloning 
methods, and clone selection techniques such as those described in U.S. Patent 
No. 5,654,173. Libraries of cDNA are made from selected tissues, such as normal or tumor 
5 tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent. 
Preferably, the tissue is the same as the tissue from which the polynucleotides of the 
invention were isolated, as both the polynucleotides described herein and the cDNA 
represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. The choice of cell type for library construction 

10 can be made after the identity of the protein encoded by the gene corresponding to the 
polynucleotide of the invention is known. This will indicate which tissue and cell types are 
likely to express the related gene, and thus represent a suitable source for the mRNA for 
generating the cDNA. As described in the Examples, cDNA of the invention was isolated 
from specific cell or tissue types, and such cells and tissues are preferable for obtaining 

1 5 related nucleic acids. 

Techniques for producing and probing nucleic acid sequence libraries are 
described, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd 
Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be 
prepared by using primers based on sequence from SEQ ID NOs: 1-3351. In one 

20 embodiment, the cDNA library can be made from only poly-adenylated mRNA. Thus, 
poly-T primers can be used to prepare cDNA from the mRNA. 

Members of the library that are larger than the provided polynucleotides, 
and preferably that encompass the complete coding sequence of the native message, are 
obtained. In order to confirm that the entire cDNA has been obtained, RNA protection 

25 experiments are performed as follows. Hybridization of a full-length cDNA to an mRNA 
will protect the RNA from RNase degradation. If the cDNA is not full length, then the 
portions of the mRNA that are not hybridized will be subject to RNase degradation. This is 
assayed, as is known in the art, by changes in electrophoretic mobility on polyacrylamide 
gels, or by detection of released monoribonucleotides. Sambrook et al., Molecular 

30 Cloning: A Laboratory Manual, 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring 
Harbor, NY. In order to obtain additional sequences 5' to the end of a partial cDNA, 5 f 
RACE (PCR Protocols: A Guide to Methods and Applications, (1990) Academic Press, 
Inc.) can be performed. 

Genomic DNA is isolated using the provided polynucleotides in a manner 

35 similar to the isolation of full-length cDNAs. Briefly, the provided polynucleotides, or 
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portions thereof, are used as probes to libraries of genomic DNA. Preferably, the library is 
obtained from the cell type that was used to generate the polynucleotides of the invention, 
but this is not essential. Most preferably, the genomic DNA is obtained from the biological 
material described herein in the Examples. Such libraries can be in vectors suitable for 
5 carrying large segments of a genome, such as PI or YAC, as described in detail in 
Sambrook et al, 9.4-9.30. In addition, genomic sequences can be isolated from human 
BAC libraries, which are commercially available from Research Genetics, Inc., Huntsville, 
Alabama, USA, for example. In order to obtain additional 5* or 3' sequences, chromosome 
walking is performed, as described in Sambrook et al., such that adjacent and overlapping 

10 fragments of genomic DNA are isolated. These are mapped and pieced together, as is 
known in the art, using restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full- 
length genes can be isolated using both classical and PCR methods to construct and probe 
cDNA libraries. Using either method, Northern blots, preferably, are performed on a 

1 5 number of cell types to determine which cell lines express the gene of interest at the highest 
level. Classical methods of constructing cDNA libraries are taught in Sambrook et al., 
supra. With these methods, cDNA can be produced from mRNA and inserted into viral or 
expression vectors. Typically, libraries of mRNA comprising poly(A) tails can be 
produced with poly(T) primers. Similarly, cDNA libraries can be produced using the 

20 instant sequences as primers. 

PCR methods are used to amplify the members of a cDNA library that 
comprise the desired insert. In this case, the desired insert will contain sequence from the 
full length cDNA that corresponds to the instant polynucleotides. Such PCR methods 
include gene trapping and RACE methods as described in Gruber et al., WO 95/04745 and 

25 Gruber et al., U.S. Patent No. 5,500,356. Kits are commercially available to perform gene 
trapping experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 
In preferred embodiments of RACE, a common primer is designed to anneal to an arbitrary 
adaptor sequence ligated to cDNA ends (Apte and Siebert, Biotechniques (1993) 75:890- 
893; Edwards et al, Nuc. Acids Res. (1991) 79:5227-5232). When a single gene-specific 

30 RACE primer is paired with the common primer, preferential amplification of sequences 
between the single gene specific primer and the common primer occurs. Commercial 
cDNA pools modified for use in RACE are available. 

The promoter region of a gene generally is located 5' to the initiation site for 
RNA polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence 

35 such as TATTA or TATAA, which is sensitive to mutations. The promoter region can be 
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obtained by performing 5' RACE using a primer from the coding region of the gene. 
Alternatively, the cDNA can be used as a probe for the genomic sequence, and the region 
5' to the coding region is identified by "walking up." If the gene is highly expressed or 
differentially expressed, the promoter from the gene can be of use in a regulatory construct 
5 for a heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can 
be prepared by site-directed mutagenesis, described in detail in Sambrook et al., 15.3- 
15.63. The choice of codon or nucleotide to be replaced can be based on disclosure herein 
on optional changes in amino acids to achieve altered protein structure and/or function. 

10 As an alternative method to obtaining DNA or RNA from a biological 

material, nucleic acid comprising nucleotides having the sequence of one or more 
polynucleotides of the invention can be synthesized. Thus, the invention encompasses 
nucleic acid molecules ranging in length from 15 nt (corresponding to at least 15 
contiguous nt of one of SEQ ID NOs: 1-3351) up to a maximum length suitable for one or 

15 more biological manipulations, including replication and expression, of the nucleic acid 
molecule. The invention includes but is not limited to (a) nucleic acid having the size of a 
full gene, and comprising at least one of SEQ ID NOs; 1-3351; (b) the nucleic acid of (a) 
also comprising at least one additional polynucleotide or gene, operably linked to permit 
expression of a fusion protein; (c) an expression vector comprising (a) or (b); (d) a plasmid 

20 comprising (a) or (b) ; and (e) a recombinant viral particle comprising (a) or (b). Once 
provided with the polynucleotides disclosed herein, construction or preparation of (a) - (e) 
are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nt of at 
least any one of SEQ ID NOs: 1-3351, preferably the entire sequence of at least any one of 

25 SEQ ID NOs: 1-3351, is not limited and can be any sequence of A, T, G, and/or C (for 
DNA) and A, U, G, and/or C (for RNA) or modified bases thereof, including inosine and 
pseudouridine. The choice of sequence will depend on the desired function and can be 
dictated by coding regions desired, the intron-like regions desired, and the regulatory 
regions desired. Where the entire sequence of any one of SEQ ID NOs:l-3351 is within 

30 the nucleic acid, the nucleic acid obtained is referred to herein as a polynucleotide 
comprising the sequence of any one of SEQ ID NOs: 1 -335 1 . 



11 



PP01598.002/ 200130.512 



Expression of Polypeptide Encoded by Full-Length cDNA or Full-Length Gene 

The provided polynucleotides (e.g., a polynucleotide having a sequence of 
one of SEQ ID NOs: 1-3351), the corresponding cDNA, or the full-length gene is used to 
express a partial or complete gene product. Constructs of polynucleotides having 
5 sequences of SEQ ID NOs: I -3351 can be generated synthetically. Alternatively, single- 
step assembly of a gene and entire plasmid from large numbers of 
oligodeoxyribonucleotides is described by, e.g., Stemmer et ah, Gene (Amsterdam) (1995) 
7<54(/):49-53. In this method, assembly PCR (the synthesis of long DNA sequences from 
large numbers of oligodeoxyribonucleotides (oligos)) is described. The method is derived 

10 from DNA shuffling (Stemmer, Nature (1994) J70:389-391), and does not rely on DNA 
ligase, but instead relies on DNA polymerase to build increasingly longer DNA fragments 
during the assembly process. 

Appropriate polynucleotide constructs are purified using standard 
recombinant DNA techniques as described in, for example, Sambrook et al. ? Molecular 

15 Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold Spring Harbor Press, Cold Spring 
Harbor, NY, and under current regulations described in United States Dept. of HHS, 
National Institute of Health (NIH) Guidelines for Recombinant DNA Research. The gene 
product encoded by a polynucleotide of the invention is expressed in any expression 
system, including, for example, bacterial, yeast, insect, amphibian and mammalian systems. 

20 Vectors, host cells and methods for obtaining expression in same are well known in the art. 
Suitable vectors and host cells are described in U.S. Patent No. 5,654,173. 

Polynucleotide molecules comprising a polynucleotide sequence provided 
herein are generally propagated by placing the molecule in a vector. Viral and non-viral 
vectors are used, including plasmids. The choice of plasmid will depend on the type of cell 

25 in which propagation is desired and the purpose of propagation. Certain vectors are useful 
for amplifying and making large amounts of the desired DNA sequence. Other vectors are 
suitable for expression in cells in culture. Still other vectors are suitable for transfer and 
expression in cells in a whole animal or person. The choice of appropriate vector is well 
within the skill of the art. Many such vectors are available commercially. Methods for 

30 preparation of vectors comprising a desired sequence are well known in the art. 

The polynucleotides set forth in SEQ ID NOs: 1-3351 or their corresponding 
full-length polynucleotides are linked to regulatory sequences as appropriate to obtain the 
desired expression properties. These can include promoters (attached either at the 5* end of 
the sense strand or at the 3' end of the antisense strand), enhancers, terminators, operators, 

35 repressors, and inducers. The promoters can be regulated or constitutive. In some 
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situations it may be desirable to use conditionally active promoters, such as tissue-specific 
or developmental stage-specific promoters. These are linked to the desired nucleotide 
sequence using the techniques described above for linkage to vectors. Any techniques 
known in the art can be used. 
5 When any appropriate host cells or organisms are used to replicate and/or 

express the polynucleotides or nucleic acids of the invention, the resulting replicated 
nucleic acid, RNA, expressed protein or polypeptide, is within the scope of the invention as 
a product of the host cell or organism. The product is recovered by any appropriate means 
known in the art. 

10 Once the gene corresponding to a selected polynucleotide is identified, its 

expression can be regulated in the cell to which the gene is native. For example, an 
endogenous gene of a cell can be regulated by an exogenous regulatory sequence as 
disclosed in U.S. Patent No. 5,641,670. 



Identification of Functional and Structural Motifs of Novel Genes 

15 Translations of the nucleotide sequence of the provided polynucleotides, 

cDNAs or full genes can be aligned with individual known sequences. Similarity with 
individual sequences can be used to determine the activity of the polypeptides encoded by 
the polynucleotides of the invention. Also, sequences exhibiting similarity with more than 
one individual sequence can exhibit activities that are characteristic of either or both 

20 individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of 
the nearest neighbors can be used as probes and primers to identify and isolate the full 
length sequence corresponding to provided polynucleotides. The nearest neighbors can 
indicate a tissue or cell type to be used to construct a library for the full-length sequences 

25 corresponding to the provided polynucleotides. 

Typically, a selected polynucleotide is translated in all six frames to 
determine the best alignment with the individual sequences. The sequences disclosed 
herein in the Sequence Listing are in a 5' to 3' orientation and translation in three frames 
can be sufficient. These amino acid sequences are referred to, generally, as query 

30 sequences, which will be aligned with the individual sequences. Databases with individual 
sequences are described in "Computer Methods for Macromolecular Sequence Analysis" 
Methods in Enzymology (1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt 
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Brace & Co., San Diego, California, USA. Databases include Genbank, EMBL, and DNA 
Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and 
computer programs described above, and include BLAST, available over the world wide 
5 web at http://www.ncbi.nlm.nhi.gov/BLAST. Another alignment algorithm is Fasta, 
available in the Genetics Computing Group (GCG) package, Madison, Wisconsin, USA, a 
wholly owned subsidiary of Oxford Molecular Group, Inc. Other techniques for alignment 
are described in Doolittle, supra. Preferably, an alignment program that permits gaps in the 
sequence is utilized to align the sequences. The Smith- Waterman is one type of algorithm 

10 that permits gaps in sequence alignments. See Metk Mol Biol (1997) 70: 173-187. Also, 
the GAP program using the Needleman and Wunsch alignment method can be utilized to 
align sequences. An alternative search strategy uses MPSRCH software, which runs on a 
MASPAR computer. MPSRCH uses a Smith- Waterman algorithm to score sequences on a 
massively parallel computer. This approach improves ability to identify sequences that are 

15 distantly related matches, and is especially tolerant of small gaps and nucleotide sequence 
errors. Amino acid sequences encoded by the provided polynucleotides can be used to 
search both protein and DNA databases. 

High Similarity . In general, in alignment results considered to be of high 
similarity, the percent of the alignment region length is typically at least about 55% of total 

20 length query sequence; more typically, at least about 58%; even more typically; at least 
about 60% of the total residue length of the query sequence. Usually, percent length of the 
alignment region can be as much as about 62%; more usually, as much as about 64%; even 
more usually, as much as about 66%. Further, for high similarity, the region of alignment, 
typically, exhibits at least about 75% of sequence identity; more typically, at least about 

25 78%; even more typically; at least about 80% sequence identity. Usually, percent sequence 
identity can be as much as about 82%; more usually, as much as about 84%; even more 
usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is 
found, the query sequence is considered to have high similarity with a profile sequence 

30 when the p value is less than or equal to about 10" 2 ; more usually; less than or equal to 
about 10" 3 ; even more usually; less than or equal to about 10* 4 . More typically, the p value 
is no more than about 10" 5 ; more typically; no more than or equal to about 10" 10 ; even more 
typically; no more than or equal to about 10' 15 for the query sequence to be considered high 
similarity. 
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Similarity Determined by Sequence Identity Alone . Sequence identity alone 
can be used to determine similarity of a query sequence to an individual sequence and can 
indicate the activity of the sequence. Such an alignment, preferably, permits gaps to align 
sequences. Typically, the query sequence is related to the profile sequence if the sequence 
5 identity over the entire query sequence is at least about 1 5%; more typically, at least about 
20%; even more typically, at least about 25%; even more typically, at least about 50%. 
Sequence identity alone as a measure of similarity is most useful when the query sequence 
is usually, at least 80 residues in length; more usually, 90 residues; even more usually, at 
least 95 amino acid residues in length. More typically, similarity can be concluded based 
10 on sequence identity alone when the query sequence is preferably 100 residues in length; 
more preferably, 120 residues in length; even more preferably, 150 amino acid residues in 
length. 

J; 3 Alignments with Profile and Multiple Aligned Sequences . Translations of 

in the provided polynucleotides can be aligned with amino acid profiles that define either 

Li 

[± 15 protein families or common motifs. Also, translations of the provided polynucleotides can 

If] be aligned to multiple sequence alignments (MSA) comprising the polypeptide sequences 

of members of protein families or motifs. Similarity or identity with profile sequences or 
MSAs can be used to determine the activity of the gene products (e.g., polypeptides) 
: f encoded by the provided polynucleotides or corresponding cDNA or genes. For example, 

Lis 

u\ 20 sequences that show an identity or similarity with a chemokine profile or MSA can exhibit 

I :f chemokine activities . 

i ; 3 Profiles can be designed manually by (1) creating an MSA, which is an 

alignment of the amino acid sequence of members that belong to the family and (2) 
constructing a statistical representation of the alignment. Such methods are described, for 

25 example, in Birney et al., Nucl Acid Res, (1996) 24(14): 2730-2739. MSAs of some 
protein families and motifs are publicly available. MSAs are described also in Sonnhammer 
et al., Proteins (1997) 28: 405-420. A brief description of MSAs is reported in Pascarella et 
al., Prot. Eng. (1996) 9(3):249-251. Techniques for building profiles from MSAs are 
described in Sonnhammer et al., supra, Birney et al., supra; and "Computer Methods for 

30 Macromolecular Sequence Analysis/' Methods in Enzymology (1996) 266, Doolittle, 
Academic Press, Inc., San Diego, California, USA. 

Similarity between a query sequence and a protein family or motif can be 
determined by (a) comparing the query sequence against the profile and/or (b) aligning the 
query sequence with the members of the family or motif. Typically, a program such as 

35 Searchwise is used to compare the query sequence to the statistical representation of the 
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multiple alignment, also known as a profile (see Birney et al., supra). Other techniques to 
compare the sequence and profile are described in Sonnhammer et al., supra and Doolittle, 
supra. 

Next, methods described by Feng et al., J. Mol. Evol (1987) 25:351 and 
5 Higgins et al., CABIOS (1989) 5:151 can be used align the query sequence with the 
members of a family or motif, also known as a MSA. Sequence alignments can be 
generated using any of a variety of software tools. Examples include PileUp, which creates 
a multiple sequence alignment, and is described in Feng et al., J. Mol. Evol. (1987) 
25:351. Another method, GAP, uses the alignment method of Needleman et al., J. Mol. 

10 Biol (1970) 48:443. GAP is best suited for global alignment of sequences. A third 
method, BestFit, functions by inserting gaps to maximize the number of matches using the 
local homology algorithm of Smith et al., Adv. Appl Math (1981) 2:482. In general, the 
following factors are used to determine if a similarity between a query sequence and a 
profile or MSA exists: (1) number of conserved residues found in the query sequence, (2) 

15 percentage of conserved residues found in the query sequence, (3) number of frameshifts, 
and (4) spacing between conserved residues. 

Some alignment programs that both translate and align sequences can make 
any number of frameshifts when translating the nucleotide sequence to produce the best 
alignment. The fewer frameshifts needed to produce an alignment, the stronger the 

20 similarity or identity between the query and profile or MSAs. For example, a weak 
similarity resulting from no frameshifts can be a better indication of activity or structure of 
a query sequence, than a strong similarity resulting from two frameshifts. Preferably, three 
or fewer frameshifts are found in an alignment; more preferably two or fewer frameshifts; 
even more preferably, one or fewer frameshifts; even more preferably, no frameshifts are 

25 found in an alignment of query and profile or MSAs. 

Conserved residues are those amino acids found at a particular position in 
all or some of the family or motif members. Alternatively, a position is considered 
conserved if only a certain class of amino acids is found in a particular position in all or 
some of the family members. For example, the N-terminal position can contain a 

30 positively charged amino acid, such as lysine, arginine, or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino 
acids or a single amino acid is found at a particular position in at least about 40% of all 
class members; more typically, at least about 50%; even more typically, at least about 60% 
of the members. Usually, a residue is conserved when a class or single amino acid is found 
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in at least about 70% of the members of a family or motif; more usually, at least about 
80%; even more usually, at least about 90%; even more usually, at least about 95%. 

A residue is considered conserved when three unrelated amino acids are 
found at a particular position in the some or all of the members; more usually, two 

5 unrelated amino acids. These residues are conserved when the unrelated amino acids are 
found at particular positions in at least about 40% of all class member; more typically, at 
least about 50%; even more typically, at least about 60% of the members. Usually, a 
residue is conserved when a class or single amino acid is found in at least about 70% of the 
members of a family or motif; more usually, at least about 80%; even more usually, at least 

1 0 about 90%; even more usually, at least about 95%. 

A query sequence has similarity to a profile or MSA when the query 
sequence comprises at least about 25% of the conserved residues of the profile or MSA; 
more usually, at least about 30%; even more usually; at least about 40%. Typically, the 
query sequence has a stronger similarity to a profile sequence or MSA when the query 

15 sequence comprises at least about 45% of the conserved residues of the profile or MSA; 
more typically, at least about 50%; even more typically; at least about 55%. 

Identification of Secreted and Membrane-Bound Polypeptides 

Both secreted and membrane-bound polypeptides of the present invention 
are of particular interest. For example, levels of secreted polypeptides can be assayed in 

20 body fluids that are convenient, such as blood, plasma, serum, and other body fluids such 
as urine, prostatic fluid and semen. Membrane-bound polypeptides are useful for 
constructing vaccine antigens or inducing an immune response. Such antigens would 
comprise all or part of the extracellular region of the membrane-bound polypeptides. 
Because both secreted and membrane-bound polypeptides comprise a fragment of 

25 contiguous hydrophobic amino acids, hydrophobicity predicting algorithms can be used to 
identify such polypeptides. 

A signal sequence is usually encoded by both secreted and membrane-bound 
polypeptide genes to direct a polypeptide to the surface of the cell. The signal sequence 
usually comprises a stretch of hydrophobic residues. Such signal sequences can fold into 

30 helical structures. Membrane-bound polypeptides typically comprise at least one 
transmembrane region that possesses a stretch of hydrophobic amino acids that can 
transverse the membrane. Some transmembrane regions also exhibit a helical structure. 
Hydrophobic fragments within a polypeptide can be identified by using computer 



17 



PP01 598.002/ 200130.512 



algorithms. Such algorithms include Hopp & Woods, Proc. Natl. Acad Sci. USA (1981) 
75:3824-3828; Kyte & Doolittle, 1 Mol Biol (1982) 157: 105-132; and RAOAR 
algorithm, Degli Esposti et al., Eur. 1 Biochem. (1990) 190: 207-219. 

Another method of identifying secreted and membrane-bound polypeptides 
5 is to translate the polynucleotides of the invention in all six frames and determine if at least 
8 contiguous hydrophobic amino acids are present. Those translated polypeptides with at 
least 8; more typically, 10; even more typically, 12 contiguous hydrophobic amino acids 
are considered to be either a putative secreted or membrane bound polypeptide. 
Hydrophobic amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, 
10 methionine, phenylalanine, proline, threonine, tryptophan, tyrosine, and valine 



Identification of the Function of an Expression Product of a Full -Length Gene 

Ribozymes, antisense constructs, and dominant negative mutants can be 
used to determine function of the expression product of a gene corresponding to a 
polynucleotide provided herein. The phosphoramidite method of oligonucleotide synthesis 

15 can be used to construct antisense molecules and ribozymes. See Beaucage et al., Tet. Lett. 
(1981) 22:1859 and U.S. Patent No. 4,668,777. Automated devices for synthesis are 
available to create oligonucleotides using this chemistry. Examples of such devices include 
Biosearch 8600, Models 392 and 394 by Applied Biosystems, a division of Perkin-Elmer 
Corp., Foster City, California, USA; and Expedite by Perceptive Biosystems, Framingham, 

20 Massachusetts, USA. Synthetic RNA, phosphate analog oligonucleotides, and chemically 
derivatized oligonucleotides can also be produced, and can be covalently attached to other 
molecules. RNA oligonucleotides can be synthesized, for example, using RNA 
phosphoramidites. This method can be performed on an automated synthesizer, such as 
Applied Biosystems, Models 392 and 394, Foster City, California, USA. 

25 Oligonucleotides of up to 200 nt can be synthesized, more typically, 100 nt, 

more typically 50 nt; even more typically 30 to 40 nt. These synthetic fragments can be 
annealed and ligated together to construct larger fragments. See, for example, Sambrook et 
al., supra. Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and 

30 the target message must contain a specific nucleotide sequence. They are engineered to 
cleave any RNA species site-specifically in the background of cellular RNA. The cleavage 
event renders the mRNA unstable and prevents protein expression. Importantly, ribozymes 
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can be used to inhibit expression of a gene of unknown function for the purpose of 
determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 

Antisense nucleic acids are designed to specifically bind to RNA, resulting 
in the formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, 
5 reverse transcription or messenger RNA translation. Antisense polynucleotides based on a 
selected polynucleotide sequence can interfere with expression of the corresponding gene. 
Antisense polynucleotides are typically generated within the cell by expression from 
antisense constructs that contain the antisense strand as the transcribed strand. Antisense 
polynucleotides based on the disclosed polynucleotides will bind and/or interfere with the 

10 translation of mRNA comprising a sequence complementary to the antisense 
polynucleotide. The expression products of control cells and cells treated with the 
antisense construct are compared to detect the protein product of the gene corresponding to 
the polynucleotide upon which the antisense construct is based. The protein is isolated and 
identified using routine biochemical methods. 

15 Given the extensive background literature and clinical experience in 

antisense therapy, one skilled in the art can use selected polynucleotides of the invention as 
additional potential therapeutics. The choice of polynucleotide can be narrowed by first 
testing them for binding to "hot spot" regions of the genome of cancerous cells. If a 
polynucleotide is identified as binding to a "hot spot," testing the polynucleotide as an 

20 antisense compound in the corresponding cancer cells is warranted. 

Dominant negative mutations also are readily generated for corresponding 
proteins that are active as homomultimers. A mutant polypeptide will interact with wild- 
type polypeptides (made from the other allele) and form a non-functional multimer. Thus, 
a mutation is in a substrate-binding domain, a catalytic domain, or a cellular localization 

25 domain. Preferably, the mutant polypeptide will be overproduced. Point mutations are 
made that have such an effect. In addition, fusion of different polypeptides of various 
lengths to the terminus of a protein can yield dominant negative mutants. General 
strategies are available for making dominant negative mutants (see, e.g., Herskowitz, 
Nature (1987) 329:219). Such techniques can be used to create loss of function mutations, 

30 which are useful for determining protein function. 

Polypeptides and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed 
polynucleotides, as well as nucleic acids that, by virtue of the degeneracy of the genetic 
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code, are not identical in sequence to the disclosed polynucleotides. Thus, the invention 
includes within its scope a polypeptide encoded by a polynucleotide having the sequence of 
any one of SEQ ID NOs: 1-3351 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full 
5 length polypeptide encoded by the recited polynucleotide, the polypeptide encoded by the 
gene represented by the recited polynucleotide, as well as portions or fragments thereof. 
"Polypeptides" also includes variants of the naturally occurring proteins, where such 
variants are homologous or substantially similar to the naturally occurring protein, and can 
be of an origin of the same or different species as the naturally occurring protein 

1 0 (e.g. , human, murine, or some other species that naturally expresses the recited polypeptide, 
usually a mammalian species). In general, variant polypeptides have a sequence that has at 
least about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide of the invention, as measured by 
BLAST using the parameters described above. The variant polypeptides can be naturally 

15 or non-naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs 
from the glycosylation pattern found in the corresponding naturally occurring protein. 

The invention also encompasses homologs of the disclosed polypeptides (or 
fragments thereof) where the homologs are isolated from other species, i.e., other animal or 
plant species, where such homologs, usually mammalian species, e.g., rodents, such as 

20 mice, rats; domestic animals, e.g., horse, cow, dog, cat; and humans. By "homolog" is 
meant a polypeptide having at least about 35%, usually at least about 40% and more 
usually at least about 60% amino acid sequence identity to a particular differentially 
expressed protein as identified above, where sequence identity is determined using the 
BLAST algorithm, with the parameters described above. 

25 In general, the polypeptides of the subject invention are provided in a non- 

naturally occurring environment, e.g., are separated from their naturally occurring 
environment. In certain embodiments, the subject protein is present in a composition that 
is enriched for the protein as compared to a control. As such, purified polypeptide is 
provided, where by purified is meant that the protein is present in a composition that is 

30 substantially free of non-differentially expressed polypeptides, where by substantially free 
is meant that less than 90%, usually less than 60% and more usually less than 50% of the 
composition is made up of non-differentially expressed polypeptides. 

Also within the scope of the invention are variants; variants of polypeptides 
include mutants, fragments, and fusions. Mutants can include amino acid substitutions, 

35 additions or deletions. The amino acid substitutions can be conservative amino acid 
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substitutions or substitutions to eliminate non-essential amino acids, such as to alter a 
glycosylation site, a phosphorylation site or an acetylation site, or to minimize misfolding 
by substitution or deletion of one or more cysteine residues that are not necessary for 
function. Conservative amino acid substitutions are those that preserve the general charge, 
5 hydrophobicity/ hydrophilicity, and/or steric bulk of the amino acid substituted. Variants 
can be designed so as to retain biological activity of a particular region of the protein (e.g., 
a functional domain and/or, where the polypeptide is a member of a protein family, a 
region associated with a consensus sequence). Selection of amino acid alterations for 
production of variants can be based upon the accessibility (interior vs. exterior) of the 

10 amino acid (see, e.g., Go et al., Int. J. Peptide Protein Res. (1980) 75:211), the 
thermostability of the variant polypeptide (see, e.g., Querol et al., Prot. Eng. (1996) 9:265), 
desired glycosylation sites (see, e.g., Olsen and Thomsen, J. Gen. Microbiol (1991) 
137:519), desired disulfide bridges (see, e.g., Clarke et al., Biochemistry (1993) 32:4322; 
and Wakarchuk et al., Protein Eng. (1994) 7:1379), desired metal binding sites (see, e.g., 

15 Toma et al., Biochemistry (1991) 30:97, and Haezerbrouck et al., Protein Eng. (1993) 
6:643), and desired substitutions with in proline loops (see, e.g., Masul et al,, Appl Env. 
Microbiol. (1994) 60:3579). Cysteine-depleted muteins can be produced as disclosed in 
U.S. Patent No. 4,959,314. 

Variants also include fragments of the polypeptides disclosed herein, 

20 particularly biologically active fragments and/or fragments corresponding to functional 
domains. Fragments of interest will typically be at least about 10 aa to at least about 15 aa 
in length, usually at least about 50 aa in length, and can be as long as 300 aa in length or 
longer, but will usually not exceed about 1000 aa in length, where the fragment will have a 
stretch of amino acids that is identical to a polypeptide encoded by a polynucleotide having 

25 a sequence of any SEQ ID NOs: 1-3351, or a homolog thereof. The protein variants 
described herein are encoded by polynucleotides that are within the scope of the invention. 
The genetic code can be used to select the appropriate codons to construct the 
corresponding variants. 

Computer-Related Embodiments 
30 In general, a library of polynucleotides is a collection of sequence 

information, which information is provided in either biochemical form (e.g., as a collection 
of polynucleotide molecules), or in electronic form (e.g., as a collection of polynucleotide 
sequences stored in a computer-readable form, as in a computer system and/or as part of a 
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computer program). The sequence information of the polynucleotides can be used in a 
variety of ways, e.g., as a resource for gene discovery, as a representation of sequences 
expressed in a selected cell type (e.g., cell type markers), and/or as markers of a given 
disease or disease state. In general, a disease marker is a representation of a gene product 
5 that is present in all cells affected by disease either at an increased or decreased level 
relative to a normal cell {e.g., a cell of the same or similar type that is not substantially 
affected by disease). For example, a polynucleotide sequence in a library can be a 
polynucleotide that represents an mRNA, polypeptide, or other gene product encoded by 
the polynucleotide, that is either overexpressed or underexpressed in a breast ductal cell 

10 affected by cancer relative to a normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any 
suitable form, e.g., electronic or biochemical forms. For example, a library of sequence 
information embodied in electronic form comprises an accessible computer data file (or, in 
biochemical form, a collection of nucleic acid molecules) that contains the representative 

15 nucleotide sequences of genes that are differentially expressed {e.g., overexpressed or 
underexpressed) as between, for example, i) a cancerous cell and a normal cell; ii) a 
cancerous cell and a dysplastic cell; iii) a cancerous cell and a cell affected by a disease or 
condition other than cancer; iv) a metastatic cancerous cell and a normal cell and/or non- 
metastatic cancerous cell; v) a malignant cancerous cell and a non-malignant cancerous cell 

20 (or a normal cell) and/or vi) a dysplastic cell relative to a normal cell. Other combinations 
and comparisons of cells affected by various diseases or stages of disease will be readily 
apparent to the ordinarily skilled artisan. Biochemical embodiments of the library include 
a collection of nucleic acids that have the sequences of the genes in the library, where the 
nucleic acids can correspond to the entire gene in the library or to a fragment thereof, as 

25 described in greater detail below. 

The polynucleotide libraries of the subject invention generally comprise 
sequence information of a plurality of polynucleotide sequences, where at least one of the 
polynucleotides has a sequence of any of SEQ ID NOs: 1-3351. By plurality is meant at 
least 2, usually at least 3 and can include up to all of SEQ ID NOs: 1-3351 . The length and 

30 number of polynucleotides in the library will vary with the nature of the library, e.g., if the 
library is an oligonucleotide array, a cDNA array, a computer database of the sequence 
information, etc. 

Where the library is an electronic library, the nucleic acid sequence 
information can be present in a variety of media. "Media" refers to a manufacture, other 
35 than an isolated nucleic acid molecule, that contains the sequence information of the 
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present invention. Such a manufacture provides the genome sequence or a subset thereof in 
a form that can be examined by means not directly applicable to the sequence as it exists in 
a nucleic acid. For example, the nucleotide sequence of the present invention, e.g., the 
nucleic acid sequences of any of the polynucleotides of SEQ ID NOs: 1-3351, can be 
5 recorded on computer readable media, e.g., any medium that can be read and accessed 
directly by a computer. Such media include, but are not limited to: magnetic storage 
media, such as a floppy disc, a hard disc storage medium, and a magnetic tape; optical 
storage media such as CD-ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories such as magnetic/optical storage media. One of skill in the art 

10 can readily appreciate how any of the presently known computer readable mediums can be 
used to create a manufacture comprising a recording of the present sequence information. 
"Recorded" refers to a process for storing information on computer readable medium, using 
any such methods as known in the art. Any convenient data storage structure can be 
chosen, based on the means used to access the stored information. A variety of data 

15 processor programs and formats can be used for storage, e.g., word processing text file, 
database format, etc. In addition to the sequence information, electronic versions of the 
libraries of the invention can be provided in conjunction or connection with other 
computer-readable information and/or other types of computer-readable files (e.g., 
searchable files, executable files, etc., including, but not limited to, for example, search 

20 program software, etc.). 

By providing the nucleotide sequence in computer readable form, the 
information can be accessed for a variety of purposes. Computer software to access 
sequence information is publicly available. For example, the BLAST (Altschul et al., 
supra.) and BLAZE (Brutlag et al. Comp. Chem. (1993) 77:203) search algorithms on a 

25 Sybase system can be used to identify open reading frames (ORFs) within the genome that 
contain homology to ORFs from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 
information of the present invention. The minimum hardware of the computer-based 

30 systems of the present invention comprises a central processing unit (CPU), input means, 
output means, and data storage means. A skilled artisan can readily appreciate that any one 
of the currently available computer-based system are suitable for use in the present 
invention. The data storage means can comprise any manufacture comprising a recording 
of the present sequence information as described above, or a memory access means that can 

35 access such a manufacture. 
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"Search means" refers to one or more programs implemented on the 
computer-based system, to compare a target sequence or target structural motif, or 
expression levels of a polynucleotide in a sample, with the stored sequence information. 
Search means can be used to identify fragments or regions of the genome that match a 
5 particular target sequence or target motif. A variety of known algorithms are publicly 
known and commercially available, e.g., MacPattern (EMBL), BLASTN and BLASTX 
(NCBI). A "target sequence" can be any polynucleotide or amino acid sequence of six or 
more contiguous nucleotides or two or more amino acids, preferably from about 1 0 to 1 00 
amino acids or from about 30 to 300 nt. A variety of comparing means can be used to 

10 accomplish comparison of sequence information from a sample (e.g., to analyze target 
sequences, target motifs, or relative expression levels) with the data storage means. A 
skilled artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer based systems of the present 
invention to accomplish comparison of target sequences and motifs. Computer programs to 

15 analyze expression levels in a sample and in controls are also known in the art. 

A "target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based 
on a three-dimensional configuration that is formed upon the folding of the target motif, or 
on consensus sequences of regulatory or active sites. There are a variety of target motifs 

20 known in the art. Protein target motifs include, but arc not limited to, enzyme active sites 
and signal sequences. Nucleic acid target motifs include, but are not limited to, hairpin 
structures, promoter sequences and other expression elements such as binding sites for 
transcription factors. 

A variety of structural formats for the input and output means can be used to 

25 input and output the information in the computer-based systems of the present invention. 
One format for an output means ranks the relative expression levels of different 
polynucleotides. Such presentation provides a skilled artisan with a ranking of relative 
expression levels to determine a gene expression profile. 

As discussed above, the "library" of the invention also encompasses 

30 biochemical libraries of the polynucleotides of SEQ ID NOs:l-3351, e.g., collections of 
nucleic acids representing the provided polynucleotides. The biochemical libraries can take 
a variety of forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably 
associated with a surface of a solid support (i.e., an array) and the like. Of particular 
interest are nucleic acid arrays in which one or more of SEQ ID NOs: 1-3351 is represented 

35 on the array. By array is meant an article of manufacture that has at least a substrate with at 
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least two distinct nucleic acid targets on one of its surfaces, where the number of distinct 
nucleic acids can be considerably higher, typically being at least 10 nt, usually at least 20 nt 
and often at least 25 nt. A variety of different array formats have been developed and are 
known to those of skill in the art. The arrays of the subject invention find use in a variety of 
5 applications, including gene expression analysis, drug screening, mutation analysis and the 
like, as disclosed in the above-listed exemplary patent documents. 

In addition to the above nucleic acid libraries, analogous libraries of 
polypeptides are also provided, where the where the polypeptides of the library will 
represent at least a portion of the polypeptides encoded by SEQ ID NOs: 1-3351. 

10 Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 

Polynucleotide probes, generally comprising at least 12 contiguous nt of a 
polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such 
as chromosome mapping of the polynucleotide and detection of transcription levels. 
Additional disclosure about preferred regions of the disclosed polynucleotide sequences is 

15 found in the Examples. A probe that hybridizes specifically to a polynucleotide disclosed 
herein should provide a detection signal at least 5-, 10-, or 20-fold higher than the 
background hybridization provided with other unrelated sequences. 

Detection of Expression Levels . Nucleotide probes are used to detect 
expression of a gene corresponding to the provided polynucleotide. In Northern blots, 

20 mRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 
condition. Probes are used for in situ hybridization to cells to detect expression. Probes 
can also be used in vivo for diagnostic detection of hybridizing sequences. Probes are 

25 typically labeled with a radioactive isotope. Other types of detectable labels can be used 
such as chromophores, fluors, and enzymes. Other examples of nucleotide hybridization 
assays are described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for 
detecting small amounts of target nucleic acids (see, e.g., Mullis et al., Metk Enzymol 

30 (1987) 755:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). Two primer 
polynucleotides nucleotides that hybridize with the target nucleic acids are used to prime 
the reaction. The primers can be composed of sequence within or 3 f and 5 f to the 
polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5* to these 
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polynucleotides, they need not hybridize to them or the complements. After amplification 
of the target with a thermostable polymerase, the amplified target nucleic acids can be 
detected by methods known in the art, e.g., Southern blot. mRNA or cDNA can also be 
detected by traditional blotting techniques (e.g., Southern blot, Northern blot, etc.) 
5 described in Sambrook et al, "Molecular Cloning: A Laboratory Manual" (New York, 
Cold Spring Harbor Laboratory, 1989) (e.g., without PCR amplification). In general, 
mRNA or cDNA generated from mRNA using a polymerase enzyme can be purified and 
separated using gel electrophoresis, and transferred to a solid support, such as 
nitrocellulose. The solid support is exposed to a labeled probe, washed to remove any 
10 unhybridized probe, and duplexes containing the labeled probe are detected. 

Mapping . Polynucleotides of the present invention can be used to identify a 
chromosome on which the corresponding gene resides. Such mapping can be useful in 
identifying the function of the polynucleotide-related gene by its proximity to other genes 
*!: with known function. Function can also be assigned to the polynucleotide-related gene 

lift 15 when particular syndromes or diseases map to the same chromosome. For example, use of 

! ! * polynucleotide probes in identification and quantification of nucleic acid sequence 

\n aberrations is described in U.S. Patent No. 5,783,387. An exemplary mapping method is 

iD fluorescence in situ hybridization (FISH), which facilitates comparative genomic 

hybridization to allow total genome assessment of changes in relative copy number of 
□ 20 DNA sequences (see, e.g., Valdes et al., Methods in Molecular Biology (1997) 68:1). 

Y'l Polynucleotides can also be mapped to particular chromosomes using, for example, 

i ; 3 radiation hybrids or chromosome-specific hybrid panels. See Leach et al., Advances in 

£3 Genetics, (1995) 53:63-99; Walter et al., Nature Genetics (1994) 7:22; Walter and 

Goodfellow, Trends in Genetics (1992) P:352. Panels for radiation hybrid mapping are 
25 available from Research Genetics, Inc., Huntsville, Alabama, USA. The statistical program 
RHMAP can be used to construct a map based on the data from radiation hybridization 
with a measure of the relative likelihood of one order versus another. RHMAP is available 
via the world wide web at http://www.sph.umich.edu-/group/statgen/software. In addition, 
commercial programs are available for identifying regions of chromosomes commonly 
30 associated with disease, such as cancer. 

Tissue Typing or Profiling . Expression of specific mRNA corresponding to 
the provided polynucleotides can vary in different cell types and can be tissue-specific. 
This variation of mRNA levels in different cell types can be exploited with nucleic acid 
probe assays to determine tissue types. For example, PCR, branched DNA probe assays, or 
35 blotting techniques utilizing nucleic acid probes substantially identical or complementary 
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to polynucleotides listed in the Sequence Listing can determine the presence or absence of 
the corresponding cDNA or mRNA. 

Tissue typing can be used to identify the developmental organ or tissue 
source of a metastatic lesion by identifying the expression of a particular marker of that 
5 organ or tissue. If a polynucleotide is expressed only in a specific tissue type, and a 
metastatic lesion is found to express that polynucleotide, then the developmental source of 
the lesion has been identified. Expression of a particular polynucleotide can be assayed by 
detection of either the corresponding mRNA or the protein product. 

Use of Polymorphisms . A polynucleotide of the invention can be used in 
10 forensics, genetic analysis, mapping, and diagnostic applications where the corresponding 
region of a gene is polymorphic in the human population. Any means for detecting a 
polymorphism in a gene can be used, including, but not limited to electrophoresis of 
p protein polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 

hybridization to allele-specific probes. 

15 Antibody Production 

fU Expression products of a polynucleotide of the invention, as well as the 

corresponding mRNA, cDNA, or complete gene, can be prepared and used for raising 

□ antibodies for experimental, diagnostic, and therapeutic purposes. For polynucleotides to 
l n which a corresponding gene has not been assigned, this provides an additional method of 
it 20 identifying the corresponding gene. The polynucleotide or related cDNA is expressed as 

□ described above, and antibodies are prepared. These antibodies are specific to an epitope 
hJ on the polypeptide encoded by the polynucleotide, and can precipitate or bind to the 

corresponding native protein in a cell or tissue preparation or in a cell-free extract of an in 
vitro expression system. 

25 Methods for production of monoclonal and polyclonal antibodies that 

specifically bind a selected antigen are well known in the art. The antibodies specifically 
bind to epitopes present in the polypeptides encoded by polynucleotides disclosed in the 
Sequence Listing. Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to 
form an epitope. Epitopes that involve non-contiguous amino acids may require a longer 

30 polypeptide, e.g., at least 15, 25, or 50 amino acids. Antibodies that specifically bind to 
human polypeptides encoded by the provided polynucleotides should provide a detection 
signal at least 5-, 10-, or 20-fold higher than a detection signal provided with other proteins 
when used in Western blots or other immunochemical assays. Preferably, antibodies that 
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specifically polypeptides of the invention do not bind to other proteins in immunochemical 
assays at detectable levels and can immunoprecipitate the specific polypeptide from 
solution. 

The invention also contemplates naturally occurring antibodies specific for a 
5 polypeptide of the invention. For example, serum antibodies to a polypeptide of the 
invention in a human population can be purified by methods well known in the art, e.g., by 
passing antiserum over a column to which the corresponding selected polypeptide or fusion 
protein is bound. The bound antibodies can then be eluted from the column, for example 
using a buffer with a high salt concentration. 
10 In addition to the antibodies discussed above, the invention also 

contemplates genetically engineered antibodies, antibody derivatives (e.g., single chain 
antibodies, antibody fragments (e.g., Fab, etc.)), according to methods well known in the 
art. 

Other embodiments of the present invention include humanized monoclonal 

15 antibodies capable of binding to the polypeptides of the invention. The phrase "humanized 
antibody" refers to an antibody derived from a non-human antibody - typically a mouse 
monoclonal antibody. Alternatively, a humanized antibody may be derived from a 
chimeric antibody that retains or substantially retains the antigen-binding properties of the 
parental, non-human, antibody but which exhibits diminished immunogenicity as compared 

20 to the parental antibody when administered to humans. The phrase "chimeric antibody," as 
used herein, refers to an antibody containing sequence derived from two different 
antibodies (see, e.g., U.S. Patent No. 4,816,567) which typically originate from different 
species. Most typically, chimeric antibodies comprise human and murine antibody 
fragments, generally human constant and mouse variable regions. 

25 Because humanized antibodies are far less immunogenic in humans than the 

parental mouse monoclonal antibodies, they can be used for the treatment of humans with 
far less risk of anaphylaxis. Thus, these antibodies may be preferred in therapeutic 
applications that involve in vivo administration to a human such as, e.g., use as radiation 
sensitizers for the treatment of neoplastic disease or use in methods to reduce the side 

30 effects of, e.g., cancer therapy. 

Humanized antibodies may be achieved by a variety of methods including, 
for example: (1) grafting the non-human complementarity determining regions (CDRs) 
onto a human framework and constant region (a process referred to in the art as 
"humanizing"), or, alternatively, (2) transplanting the entire non-human variable domains, 

35 but "cloaking" them with a human-like surface by replacement of surface residues (a 
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process referred to in the art as "veneering"). In the present invention, humanized 
antibodies will include both "humanized" and "veneered" antibodies. These methods are 
disclosed in, e.g., Jones et al., Nature 321: 522-525 (1986); Morrison et al., Proc. Natl 
Acad. Sci., U.S.A., 57:6851-6855 (1984); Morrison and Oi, Adv. Immunol., 44:65-92 
5 (1988); Verhoeyer et al., Science 239:1534-1536 (1988); Padlan, Molec. Immun. 25:489- 
498 (1991); Padlan, Molec. Immunol. 57^:169-217 (1994); and Kettleborough, C.A. et al, 
Protein Eng. 4(7):773-&3 (1991) each of which is incorporated herein by reference. 

The phrase "complementarity determining region" refers to amino acid 
sequences which together define the binding affinity and specificity of the natural Fv 

10 region of a native immunoglobulin binding site. See, e.g., Chothia et al, J. Mol. Biol. 
796:901-917 (1987); Kabat et al., U.S. Dept. of Health and Human Services NIH 
Publication No. 91-3242 (1991). The phrase "constant region" refers to the portion of the 
antibody molecule that confers effector functions. In the present invention, mouse constant 
regions are substituted by human constant regions. The constant regions of the subject 

15 humanized antibodies are derived from human immunoglobulins. The heavy chain 
constant region can be selected from any of the five isotypes: alpha, delta, epsilon, gamma 
or mu. 

One method of humanizing antibodies comprises aligning the non-human 
heavy and light chain sequences to human heavy and light chain sequences, selecting and 

20 replacing the non-human framework with a human framework based on such alignment, 
molecular modeling to predict the conformation of the humanized sequence and comparing 
to the conformation of the parent antibody. This process is followed by repeated back 
mutation of residues in the CDR region which disturb the structure of the CDRs until the 
predicted conformation of the humanized sequence model closely approximates the 

25 conformation of the non-human CDRs of the parent non-human antibody. Such humanized 
antibodies may be further derivatized to facilitate uptake and clearance, e.g., via Ashwell 
receptors. See, e.g., U.S. Patent Nos. 5,530,101 and 5,585,089 which patents are 
incorporated herein by reference. 

Humanized antibodies can also be produced using transgenic animals that 

30 are engineered to contain human immunoglobulin loci. For example, WO 98/24893 
discloses transgenic animals having a human Ig locus wherein the animals do not produce 
functional endogenous immunoglobulins due to the inactivation of endogenous heavy and 
light chain loci. WO 91/10741 also discloses transgenic non-primate mammalian hosts 
capable of mounting an immune response to an immunogen, wherein the antibodies have 

35 primate constant and/or variable regions, and wherein the endogenous immunoglobulin- 
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encoding loci are substituted or inactivated. WO 96/30498 discloses the use of the Cre/Lox 
system to modify the immunoglobulin locus in a mammal, such as to replace all or a 
portion of the constant or variable region to form a modified antibody molecule. WO 
94/02602 discloses non-human mammalian hosts having inactivated endogenous Ig loci 
5 and functional human Ig loci. U.S. Patent No. 5,939,598 discloses methods of making 
transgenic mice in which the mice lack endogenous heavy claims, and express an 
exogenous immunoglobulin locus comprising one or more xenogeneic constant regions. 

Using a transgenic animal described above, an immune response can be 
produced to a selected antigenic molecule, and antibody-producing cells can be removed 

10 from the animal and used to produce hybridomas that secrete human monoclonal 
antibodies. Immunization protocols, adjuvants, and the like are known in the art, and are 
used in immunization of, for example, a transgenic mouse as described in WO 96/33735. 
This publication discloses monoclonal antibodies against a variety of antigenic molecules 
including IL-6, IL-8, TNF , human CD4, L-selectin, gp39, and tetanus toxin. The 

15 monoclonal antibodies can be tested for the ability to inhibit or neutralize the biological 
activity or physiological effect of the corresponding protein. WO 96/33735 discloses that 
monoclonal antibodies against IL-8. derived from immune cells of transgenic mice 
immunized with IL-8, blocked IL-8-induced functions of neutrophils. Human monoclonal 
antibodies with specificity for the antigen used to immunize transgenic animals are also 

20 disclosed in WO 96/34096. 

Polynucleotides or Arrays for Diagnostics 

Polynucleotide arrays are created by spotting polynucleotide probes onto a 
substrate (e.g., glass, nitrocellose, etc.) in a two-dimensional matrix or array having bound 

25 probes. The probes can be bound to the substrate by either covalent bonds or by non- 
specific interactions, such as hydrophobic interactions. Samples of polynucleotides can be 
detectably labeled (e.g., using radioactive or fluorescent labels) and then hybridized to the 
probes. Double stranded polynucleotides, comprising the labeled sample polynucleotides 
bound to probe polynucleotides, can be detected once the unbound portion of the sample is 

30 washed away. Techniques for constructing arrays and methods of using these arrays are 
described in EP 799 897; WO 97/29212; WO 97/27317; EP 785 280; WO 97/02357; U.S. 
Patent No. 5,593,839; U.S. Patent No. 5,578,832; EP 728 520; U.S. Patent No. 5,599,695; 
EP 721 016; U.S. Patent No. 5,556,752; WO 95/22058; and U.S. Patent No. 5,631,734. 
Arrays can be used to, for example, examine differential expression of genes and can be 
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used to determine gene function. For example, arrays can be used to detect differential 
expression of a polynucleotide between a test cell and control cell (e.g., cancer cells and 
normal cells). For example, high expression of a particular message in a cancer cell, which 
is not observed in a corresponding normal cell, can indicate a cancer specific gene product. 
5 Exemplary uses of arrays are further described in, for example, Pappalarado et al., Sem. 
Radiation Oncol. (1998) S:217; and Ramsay, Nature Biotechnol (1998) 76:40. 

Differential Expression in Diagnosis 

The polynucleotides of the invention can also be used to detect differences 
in expression levels between two cells, e.g., as a method to identify abnormal or diseased 

10 tissue in a human. For polynucleotides corresponding to profiles of protein families, the 
choice of tissue can be selected according to the putative biological function. In general, 
the expression of a gene corresponding to a specific polynucleotide is compared between a 
first tissue that is suspected of being diseased and a second, normal tissue of the human. 
The tissue suspected of being abnormal or diseased can be derived from a different tissue 

15 type of the human, but preferably it is derived from the same tissue type; for example an 
intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of 
interest (e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal 

20 muscle, pancreas, and the mucosal lining of the colon). A difference between the 
polynucleotide-related gene, mRNA, or protein in the two tissues which are compared, for 
example in molecular weight, amino acid or nucleotide sequence, or relative abundance, 
indicates a change in the gene, or a gene which regulates it, in the tissue of the human that 
was suspected of being diseased. Examples of detection of differential expression and its 

25 use in diagnosis of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

A genetic predisposition to disease in a human can also be detected by 
comparing expression levels of an mRNA or protein corresponding to a polynucleotide of 
the invention in a fetal tissue with levels associated in normal fetal tissue. Fetal tissues that 
are used for this purpose include, but are not limited to, amniotic fluid, chorionic villi, 

30 blood, and the blastomere of an in v/Yro-fertilized embryo. The comparable normal 
polynucleotide-related gene is obtained from any tissue. The mRNA or protein is obtained 
from a normal tissue of a human in which the polynucleotide-related gene is expressed. 
Differences such as alterations in the nucleotide sequence or size of the same product of the 
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fetal polynucleotide-related gene or mRNA, or alterations in the molecular weight, amino 
acid sequence, or relative abundance of fetal protein, can indicate a germline mutation in 
the polynucleotide-related gene of the fetus, which indicates a genetic predisposition to 
disease. In general, diagnostic, prognostic, and other methods of the invention based on 
5 differential expression involve detection of a level or amount of a gene product, particularly 
a differentially expressed gene product, in a test sample obtained from a patient suspected 
of having or being susceptible to a disease (e.g., breast cancer, lung cancer, colon cancer 
and/or metastatic forms thereof), and comparing the detected levels to those levels found in 
normal cells (e.g., cells substantially unaffected by cancer) and/or other control cells (e.g., 

10 to differentiate a cancerous cell from a cell affected by dysplasia). Furthermore, the 
severity of the disease can be assessed by comparing the detected levels of a differentially 
expressed gene product with those levels detected in samples representing the levels of 
differentially gene product associated with varying degrees of severity of disease. It should 
be noted that use of the term "diagnostic" herein is not necessarily meant to exclude 

1 5 "prognostic" or "prognosis," but rather is used as a matter of convenience. 

The term "differentially expressed gene" is generally intended to encompass 
a polynucleotide that can, for example, include an open reading frame encoding a gene 
product (e.g., a polypeptide), and/or introns of such genes and adjacent 5' and 3' non-coding 
nucleotide sequences involved in the regulation of expression, up to about 20 kb beyond 

20 the coding region, but possibly further in either direction. The gene can be introduced into 
an appropriate vector for extrachromosomal maintenance or for integration into a host 
genome. In general, a difference in expression level associated with a decrease in 
expression level of at least about 25%, usually at least about 50% to 75%, more usually at 
least about 90% or more is indicative of a differentially expressed gene of interest, i.e., a 

25 gene that is underexpressed or down-regulated in the test sample relative to a control 
sample. Furthermore, a difference in expression level associated with an increase in 
expression of at least about 25%, usually at least about 50% to 75%, more usually at least 
about 90% and can be at least about 1 !/2-fold, usually at least about 2-fold to about 10-fold, 
and can be about 100-fold to about 1,000-fold increase relative to a control sample is 

30 indicative of a differentially expressed gene of interest, i.e., an overexpressed or up- 
regulated gene. 

"Differentially expressed polynucleotide" as used herein means a nucleic 
acid molecule (RNA or DNA) comprising a sequence that represents a differentially 
expressed gene, e.g., the differentially expressed polynucleotide comprises a sequence (e.g., 
35 an open reading frame encoding a gene product) that uniquely identifies a differentially 
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expressed gene so that detection of the differentially expressed polynucleotide in a sample 
is correlated with the presence of a differentially expressed gene in a sample. 
"Differentially expressed polynucleotides" is also meant to encompass fragments of the 
disclosed polynucleotides, e.g., fragments retaining biological activity, as well as nucleic 
5 acids homologous, substantially similar, or substantially identical (e.g., having about 90% 
sequence identity) to the disclosed polynucleotides. 

"Diagnosis" as used herein generally includes determination of a subject's 
susceptibility to a disease or disorder, determination as to whether a subject is presently 
affected by a disease or disorder, as well as to the prognosis of a subject affected by a 

10 disease or disorder (e.g., identification of pre-metastatic or metastatic cancerous states, 
stages of cancer, or responsiveness of cancer to therapy). The present invention 
particularly encompasses diagnosis of subjects in the context of breast cancer (e.g., 
carcinoma in situ (e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive breast 
cancer, ER-negative breast cancer, or other forms and/or stages of breast cancer), lung 

15 cancer (e.g., small cell carcinoma, non-small cell carcinoma, mesothelioma, and other 
forms and/or stages of lung cancer), and colon cancer (e.g., adenomatous polyp, colorectal 
carcinoma, and other forms and/or stages of colon cancer). 

"Sample" or "biological sample" as used throughout here are generally 
meant to refer to samples of biological fluids or tissues, particularly samples obtained from 

20 tissues, especially from cells of the type associated with the disease for which the 
diagnostic application is designed (e.g., ductal adenocarcinoma), and the like. "Samples" is 
also meant to encompass derivatives and fractions of such samples (e.g., cell lysates). 
Where the sample is solid tissue, the cells of the tissue can be dissociated or tissue sections 
can be analyzed. 

25 Methods of the subject invention useful in diagnosis or prognosis typically 

involve comparison of the abundance of a selected differentially expressed gene product in 
a sample of interest with that of a control to determine any relative differences in the 
expression of the gene product, where the difference can be measured qualitatively and/or 
quantitatively. Quantitation can be accomplished, for example, by comparing the level of 

30 expression product detected in the sample with the amounts of product present in a standard 
curve. A comparison can be made visually; by using a technique such as densitometry, 
with or without computerized assistance; by preparing a representative library of cDNA 
clones of mRNA isolated from a test sample, sequencing the clones in the library to 
determine that number of cDNA clones corresponding to the same gene product, and 

35 analyzing the number of clones corresponding to that same gene product relative to the 
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number of clones of the same gene product in a control sample; or by using an array to 
detect relative levels of hybridization to a selected sequence or set of sequences, and 
comparing the hybridization pattern to that of a control. The differences in expression are 
then correlated with the presence or absence of an abnormal expression pattern. A variety 
5 of different methods for determining the nucleic acid abundance in a sample are known to 
those of skill in the art (see, e.g., WO 97/273 1 7).In general, diagnostic assays of the 
invention involve detection of a gene product of a the polynucleotide sequence (e.g., 
mRNA or polypeptide) that corresponds to a sequence of SEQ ID NOs: 1-3351. The patient 
from whom the sample is obtained can be apparently healthy, susceptible to disease (e.g., 

10 as determined by family history or exposure to certain environmental factors), or can 
already be identified as having a condition in which altered expression of a gene product of 
the invention is implicated. 

Diagnosis can be determined based on detected gene product expression 
levels of a gene product encoded by at least one, preferably at least two or more, at least 3 

15 or more, or at least 4 or more of the polynucleotides having a sequence set forth in SEQ ID 
NOs: 1-3351, and can involve detection of expression of genes corresponding to all of SEQ 
ID NOs: 1-3351 and/or additional sequences that can serve as additional diagnostic markers 
and/or reference sequences. Where the diagnostic method is designed to detect the 
presence or susceptibility of a patient to cancer, the assay preferably involves detection of a 

20 gene product encoded by a gene corresponding to a polynucleotide that is differentially 
expressed in cancer. Examples of such differentially expressed polynucleotides are 
described in the Examples below. Given the provided polynucleotides and information 
regarding their relative expression levels provided herein, assays using such 
polynucleotides and detection of their expression levels in diagnosis and prognosis will be 

25 readily apparent to the ordinarily skilled artisan. 

Any of a variety of detectable labels can be used in connection with the 
various embodiments of the diagnostic methods of the invention. Suitable detectable labels 
include fluorochromes, (e.g., fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4',5'- 

3 0 dichloro-6-carboxy fluorescein, 6-carboxy -X-rhodamine (ROX), 6-carboxy-2 \4 \7' ,4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'-tetramethyl-6- 
carboxyrhodamine (TAMRA)), radioactive labels, (e.g., 32 P, 35 S, 3 H, etc), and the like. The 
detectable label can involve a two stage systems (e.g., biotin-avidin, hapten-anti-hapten 
antibody, etc.) 
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Reagents specific for the polynucleotides and polypeptides of the invention, 
such as antibodies and nucleotide probes, can be supplied in a kit for detecting the presence 
of an expression product in a biological sample. The kit can also contain buffers or 
labeling components, as well as instructions for using the reagents to detect and quantify 
5 expression products in the biological sample. Exemplary embodiments of the diagnostic 
methods of the invention are described below in more detail. 

Polypeptide detection in diagnosis . In one embodiment, the test sample is 
assayed for the level of a differentially expressed polypeptide. Diagnosis can be 
accomplished using any of a number of methods to determine the absence or presence or 

10 altered amounts of the differentially expressed polypeptide in the test sample. For 
example, detection can utilize staining of cells or histological sections with labeled 
antibodies, performed in accordance with conventional methods. Cells can be 
permeabilized to stain cytoplasmic molecules. In general, antibodies that specifically bind 
a differentially expressed polypeptide of the invention are added to a sample, and incubated 

15 for a period of time sufficient to allow binding to the epitope, usually at least about 1 0 
minutes. The antibody can be detectably labeled for direct detection {e.g., using 
radioisotopes, enzymes, fluoresces, chemiluminescers, and the like), or can be used in 
conjunction with a second stage antibody or reagent to detect binding {e.g., biotin with 
horseradish peroxidase-conjugated avidin, a secondary antibody conjugated to a fluorescent 

20 compound, e.g., fluorescein, rhodamine, Texas red, etc.). The absence or presence of 
antibody binding can be determined by various methods, including flow cytometry of 
dissociated cells, microscopy, radiography, scintillation counting, etc. Any suitable 
alternative methods can of qualitative or quantitative detection of levels or amounts of 
differentially expressed polypeptide can be used, for example ELISA, western blot, 

25 immunoprecipitation, radioimmunoassay, etc. 

mRNA detection . The diagnostic methods of the invention can also or 
alternatively involve detection of mRNA encoded by a gene corresponding to a 
differentially expressed polynucleotides of the invention. Any suitable qualitative or 
quantitative methods known in the art for detecting specific mRNAs can be used. mRNA 

30 can be detected by, for example, in situ hybridization in tissue sections, by reverse 
transcriptase-PCR, or in Northern blots containing poly A+ mRNA. One of skill in the art 
can readily use these methods to determine differences in the size or amount of mRNA 
transcripts between two samples. mRNA expression levels in a sample can also be 
determined by generation of a library of expressed sequence tags (ESTs) from the sample, 

35 where the EST library is representative of sequences present in the sample (Adams, et al., 
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(1991) Science 252:1651). Enumeration of the relative representation of ESTs within the 
library can be used to approximate the relative representation of the gene transcript within 
the starting sample. The results of EST analysis of a test sample can then be compared to 
EST analysis of a reference sample to determine the relative expression levels of a selected 
5 polynucleotide, particularly a polynucleotide corresponding to one or more of the 
differentially expressed genes described herein. Alternatively, gene expression in a test 
sample can be performed using serial analysis of gene expression (SAGE) methodology 
(e.g., Velculescu et al., Science (1995) 270:484) or differential display (DD) methodology 
(see, e.g., U.S. Patent NOs. 5,776,683 and 5,807,680). 

10 Alternatively, gene expression can be analyzed using hybridization analysis. 

Oligonucleotides or cDNA can be used to selectively identify or capture DNA or RNA of 
specific sequence composition, and the amount of RNA or cDNA hybridized to a known 
capture sequence determined qualitatively or quantitatively, to provide information about 
the relative representation of a particular message within the pool of cellular messages in a 

1 5 sample. Hybridization analysis can be designed to allow for concurrent screening of the 
relative expression of hundreds to thousands of genes by using, for example, array-based 
technologies having high density formats, including filters, microscope slides, or 
microchips, or solution-based technologies that use spectroscopic analysis (e.g., mass 
spectrometry). One exemplary use of arrays in the diagnostic methods of the invention is 

20 described below in more detail. 

Use of a single gene in diagnostic applications . The diagnostic methods of 
the invention can focus on the expression of a single differentially expressed gene. For 
example, the diagnostic method can involve detecting a differentially expressed gene, or a 
polymorphism of such a gene (e.g., a polymorphism in an coding region or control region), 

25 that is associated with disease. Disease-associated polymorphisms can include deletion or 
truncation of the gene, mutations that alter expression level and/or affect activity of the 
encoded protein, etc. 

A number of methods are available for analyzing nucleic acids for the 
presence of a specific sequence, e.g., a disease associated polymorphism. Where large 

30 amounts of DNA are available, genomic DNA is used directly. Alternatively, the region of 
interest is cloned into a suitable vector and grown in sufficient quantity for analysis. Cells 
that express a differentially expressed gene can be used as a source of mRNA, which can be 
assayed directly or reverse transcribed into cDNA for analysis. The nucleic acid can be 
amplified by conventional techniques, such as the polymerase chain reaction (PCR), to 

35 provide sufficient amounts for analysis, and a detectable label can be included in the 
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amplification reaction (e.g., using a detectably labeled primer or detectably labeled 
oligonucleotides) to facilitate detection. Alternatively, various methods are also known in 
the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, see e.g., 
Riley et al., Nucl Acids Res. (1990) 75:2887; and Delahunty et al., Am. J. Hum. Genet. 
5 (1996)55:1239. 

The amplified or cloned sample nucleic acid can be analyzed by one of a 
number of methods known in the art. The nucleic acid can be sequenced by dideoxy or 
other methods, and the sequence of bases compared to a selected sequence, e.g., to a wild- 
type sequence. Hybridization with the polymorphic or variant sequence can also be used to 

10 determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The hybridization 
pattern of a polymorphic or variant sequence and a control sequence to an array of 
oligonucleotide probes immobilized on a solid support, as described in U.S. Patent 
No. 5,445,934, or in WO 95/35505, can also be used as a means of identifying polymorphic 
or variant sequences associated with disease. Single strand conformational polymorphism 

15 (SSCP) analysis, denaturing gradient gel electrophoresis (DGGE), and heteroduplex 
analysis in gel matrices are used to detect conformational changes created by DNA 
sequence variation as alterations in electrophoretic mobility. Alternatively, where a 
polymorphism creates or destroys a recognition site for a restriction endonuclease, the 
sample is digested with that endonuclease, and the products size fractionated to determine 

20 whether the fragment was digested. Fractionation is performed by gel or capillary 
electrophoresis, particularly acrylamide or agarose gels. 

Screening for mutations in a gene can be based on the functional or 
antigenic characteristics of the protein. Protein truncation assays are useful in detecting 
deletions that can affect the biological activity of the protein. Various immunoassays 

25 designed to detect polymorphisms in proteins can be used in screening. Where many 
diverse genetic mutations lead to a particular disease phenotype, functional protein assays 
have proven to be effective screening tools. The activity of the encoded protein can be 
determined by comparison with the wild-type protein. 

Pattern matching in diagnosis using arrays . In another embodiment, the 

30 diagnostic and/or prognostic methods of the invention involve detection of expression of a 
selected set of genes in a test sample to produce a test expression pattern (TEP). The TEP 
is compared to a reference expression pattern (REP), which is generated by detection of 
expression of the selected set of genes in a reference sample (e.g., a positive or negative 
control sample). The selected set of genes includes at least one of the genes of the 

35 invention, which genes correspond to the polynucleotide sequences of SEQ ID 
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NOs:l-3351. Of particular interest is a selected set of genes that includes genes 
differentially expressed in the disease for which the test sample is to be screened. 

"Reference sequences" or "reference polynucleotides" as used herein in the 
context of differential gene expression analysis and diagnosis/prognosis refers to a selected 
5 set of polynucleotides, which selected set includes at least one or more of the differentially 
expressed polynucleotides described herein. A plurality of reference sequences, preferably 
comprising positive and negative control sequences, can be included as reference 
sequences. Additional suitable reference sequences are found in Genbank, Un.gene, and 
other nucleotide sequence databases (including, e.g., expressed sequence tag (EST), partial, 

10 and full-length sequences). 

"Reference array" means an array having reference sequences for use m 
hybridization with a sample, where the reference sequences include all, at least one of, or 
any subset of the differentially expressed polynucleotides described herein. Usually such 
an array will include at least 3 different reference sequences, and can include any one or all 
1 5 of the provided differentially expressed sequences. Arrays of interest can further comprise 
sequences, including polymorphisms, of other genetic sequences, particularly other 
sequences of interest for screening for a disease or disorder (e.g , cancer, dysplasia, or other 
related or unrelated diseases, disorders, or conditions). The oligonucleotide sequence on 
the array will usually be at least about 12 nt in length, and can be of about the length of the 
20 provided sequences, or can extend into the flanking regions to generate fragments of 100 nt 
to 200 nt in length or more. Reference arrays can be produced according to any suitable 
methods known in the art. For example, methods of producing large arrays of 
oligonucleotides are described in U.S. Patent NOs. 5,134,854 and 5,445,934 using light- 
directed synthesis techniques. Using a computer controlled system, a heterogeneous array 
25 of monomers is converted, through simultaneous coupling at a number of reaction sites, 
into a heterogeneous array of polymers. Alternatively, microarrays are generated by 
deposition of pre-synthesized oligonucleotides onto a solid substrate, for example as 
described in PCT published application no. WO 95/35505. 

A "reference expression pattern" or "REP" as used herein refers to the 
30 relative levels of expression of a selected set of genes, particularly of differentially 
expressed genes, that is associated with a selected cell type, e.g., a normal cell, a cancerous 
cell a cell exposed to an environmental stimulus, and the like. A "test expression pattern" 
or "TEP" refers to relative levels of expression of a selected set of genes, particularly of 
differentially expressed genes, in a test sample (e.g., a cell of unknown or suspected disease 
3 5 state, from which mRN A is isolated). 
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REPs can be generated in a variety of ways according to methods well 
known in the art. For example, REPs can be generated by hybridizing a control sample to 
an array having a selected set of polynucleotides (particularly a selected set of differentially 
expressed polynucleotides), acquiring the hybridization data from the array, and storing the 
5 data in a format that allows for ready comparison of the REP with a TEP. Alternatively, all 
expressed sequences in a control sample can be isolated and sequenced, e.g., by isolating 
mRNA from a control sample, converting the mRNA into cDNA, and sequencing the 
cDNA. The resulting sequence information roughly or precisely reflects the identity and 
relative number of expressed sequences in the sample. The sequence information can then 

10 be stored in a format {e.g. , a computer-readable format) that allows for ready comparison of 
the REP with a TEP. The REP can be normalized prior to or after data storage, and/or can 
be processed to selectively remove sequences of expressed genes that are of less interest or 
that might complicate analysis {e.g., some or all of the sequences associated with 
housekeeping genes can be eliminated from REP data). 

15 TEPs can be generated in a manner similar to REPs, e.g., by hybridizing a 

test sample to an array having a selected set of polynucleotides, particularly a selected set 
of differentially expressed polynucleotides, acquiring the hybridization data from the array, 
and storing the data in a format that allow- s for ready comparison of the TEP with a REP. 
The REP and TEP to be used in a comparison can be generated simultaneously, or the TEP 

20 can be compared to previously generated and stored REPs. 

In one embodiment of the invention, comparison of a TEP with a REP 
involves hybridizing a test sample with a reference array, where the reference array has one 
or more reference sequences for use in hybridization with a sample. The reference 
sequences include all, at least one of, or any subset of the differentially expressed 

25 polynucleotides described herein. Hybridization data for the test sample is acquired, the 
data normalized, and the produced TEP compared with a REP generated using an array 
having the same or similar selected set of differentially expressed polynucleotides. Probes 
that correspond to sequences differentially expressed between the two samples will show 
decreased or increased hybridization efficiency for one of the samples relative to the other. 

30 Methods for collection of data from hybridization of samples with a 

reference arrays are well known in the art. For example, the polynucleotides of the 
reference and test samples can be generated using a detectable fluorescent label, and 
hybridization of the polynucleotides in the samples detected by scanning the microarrays 
for the presence of the detectable label using, for example, a microscope and light source 

35 for directing light at a substrate. A photon counter detects fluorescence from the substrate, 
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while an x-y translation stage varies the location of the substrate. A confocal detection 
device that can be used in the subject methods is described in U.S. Patent No. 5,631,734. A 
scanning laser microscope is described in Shalon et al., Genome Res. (1996) 6:639. A scan, 
using the appropriate excitation line, is performed for each fluorophore used. The digital 
5 images generated from the scan are then combined for subsequent analysis. For any 
particular array element, the ratio of the fluorescent signal from one sample (e.g., a test 
sample) is compared to the fluorescent signal from another sample (e.g., a reference 
sample), and the relative signal intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are 

10 well known in the art. For example, where detection of hybridization involves a 
fluorescent label, data analysis can include the steps of determining fluorescent intensity as 
a function of substrate position from the data collected, removing outliers, i.e., data 
deviating from a predetermined statistical distribution, and calculating the relative binding 
affinity of the targets from the remaining data. The resulting data can be displayed as an 

15 image with the intensity in each region varying according to the binding affinity between 
targets and probes. 

In general, the test sample is classified as having a gene expression profile 
corresponding to that associated with a disease or non-disease state by comparing the TEP 
generated from the test sample to one or more REPs generated from reference samples 

20 (e.g., from samples associated with cancer or specific stages of cancer, dysplasia, samples 
affected by a disease other than cancer, nonnal samples, etc.). The criteria for a match or a 
substantial match between a TEP and a REP include expression of the same or substantially 
the same set of reference genes, as well as expression of these reference genes at 
substantially the same levels (e.g., no significant difference between the samples for a 

25 signal associated with a selected reference sequence after normalization of the samples, or 
at least no greater than about 25% to about 40% difference in signal strength for a given 
reference sequence. In general, a pattern match between a TEP and a REP includes a match 
in expression, preferably a match in qualitative or quantitative expression level, of at least 
one of, all or any subset of the differentially expressed genes of the invention. 

30 Pattern matching can be performed manually, or can be performed using a 

computer program. Methods for preparation of substrate matrices (e.g., arrays), design of 
oligonucleotides for use with such matrices, labeling of probes, hybridization conditions, 
scanning of hybridized matrices, and analysis of patterns generated, including comparison 
analysis, are described in, for example, U.S. Patent No. 5,800,992. 
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Diagnosis, Prognosis and Management of Cancer 

The polynucleotides of the invention and their gene products are of 
particular interest as genetic or biochemical markers (e.g., in blood or tissues) that will 
detect the earliest changes along the carcinogenesis pathway and/or to monitor the efficacy 
5 of various therapies and preventive interventions. For example, the level of expression of 
certain polynucleotides can be indicative of a poorer prognosis, and therefore warrant more 
aggressive chemo- or radio-therapy for a patient or vice versa. The correlation of novel 
surrogate tumor specific features with response to treatment and outcome in patients can 
define prognostic indicators that allow the design of tailored therapy based on the 

10 molecular profile of the tumor. These therapies include antibody targeting and gene 
therapy. Determining expression of certain polynucleotides and comparison of a patients 
profile with known expression in normal tissue and variants of the disease allows a 
determination of the best possible treatment for a patient, both in terms of specificity of 
treatment and in terms of comfort level of the patient. Surrogate tumor markers, such as 

1 5 polynucleotide expression, can also be used to better classify, and thus diagnose and treat, 
different forms and disease states of cancer. Two classifications widely used in oncology 
that can benefit from identification of the expression levels of the polynucleotides of the 
invention are staging of the cancerous disorder, and grading the nature of the cancerous 
tissue. 

20 The polynucleotides of the invention can be useful to monitor patients 

having or susceptible to cancer to detect potentially malignant events at a molecular level 
before they are detectable at a gross morphological level. Furthermore, a polynucleotide of 
the invention identified as important for one type of cancer can also have implications for 
development or risk of development of other types of cancer, e.g., where a polynucleotide 

25 is differentially expressed across various cancer types. Thus, for example, expression of a 
polynucleotide that has clinical implications for metastatic colon cancer can also have 
clinical implications for stomach cancer or endometrial cancer. 

Staging . Staging is a process used by physicians to describe how advanced 
the cancerous state is in a patient. Generally, if a cancer is only detectable in the area of the 

30 primary lesion without having spread to any lymph nodes it is called Stage I. If it has 
spread only to the closest lymph nodes, it is called Stage II. In Stage III, the cancer has 
generally spread to the lymph nodes in near proximity to the site of the primary lesion. 
Cancers that have spread to a distant part of the body, such as the liver, bone, brain or other 
site, are Stage IV, the most advanced stage. 
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The polynucleotides of the invention can facilitate fine-tuning of the staging 
process by identifying markers for the aggresivity of a cancer, e.g., the metastatic potential, 
as well as the presence in different areas of the body. Thus, a Stage II cancer with a 
polynucleotide signifying a high metastatic potential cancer can be used to change a 
5 borderline Stage II tumor to a Stage III tumor, justifying more aggressive therapy. 
Conversely, the presence of a polynucleotide signifying a lower metastatic potential allows 
more conservative staging of a tumor. 

Grading of cancers . Grade is a term used to describe how closely a tumor 
resembles normal tissue of its same type. The microscopic appearance of a tumor is used to 

10 identify tumor grade based on parameters such as cell morphology, cellular organization, 
and other markers of differentiation. As a general rule, the grade of a tumor corresponds to 
its rate of growth or aggressiveness, with undifferentiated or high-grade tumors being more 
aggressive than well differentiated or low-grade tumors. The following guidelines are 
generally used for grading tumors: 1) GX Grade cannot be assessed; 2) Gl Well 

15 differentiated; G2 Moderately well differentiated; 3) G3 Poorly differentiated; 4) G4 
Undifferentiated. The polynucleotides of the invention can be especially valuable in 
determining the grade of the tumor, as they not only can aid in determining the 
differentiation status of the cells of a tumor, they can also identify factors other than 
differentiation that are valuable in determining the aggressivity of a tumor, such as 

20 metastatic potential. 

Detection of lung cancer . The polynucleotides of the invention can be used 
to detect lung cancer in a subject. Although there are more than a dozen different kinds of 
lung cancer, the two main types of lung cancer are small cell and nonsmall cell, which 
encompass about 90% of all lung cancer cases. Small cell carcinoma (also called oat cell 

25 carcinoma) usually starts in one of the larger bronchial tubes, grows fairly rapidly, and is 
likely to be large by the time of diagnosis. Nonsmall cell lung cancer (NSCLC) is made up 
of three general subtypes of lung cancer. Epidermoid carcinoma (also called squamous cell 
carcinoma) usually starts in one of the larger bronchial tubes and grows relatively slowly. 
The size of these tumors can range from very small to quite large. Adenocarcinoma starts 

30 growing near the outside surface of the lung and can vary in both size and growth rate. 
Some slowly growing adenocarcinomas are described as alveolar cell cancer. Large cell 
carcinoma starts near the surface of the lung, grows rapidly, and the growth is usually fairly 
large when diagnosed. Other less common forms of lung cancer are carcinoid, cylindroma, 
mucoepidermoid, and malignant mesothelioma. 
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The polynucleotides of the invention, e.g., polynucleotides differentially 
expressed in normal cells versus cancerous lung cells (e.g., tumor cells of high or low 
metastatic potential) or between types of cancerous lung cells (e.g., high metastatic versus 
low metastatic), can be used to distinguish types of lung cancer as well as identifying traits 
5 specific to a certain patient's cancer and selecting an appropriate therapy. For example, if 
the patient's biopsy expresses a polynucleotide that is associated with a low metastatic 
potential, it may justify leaving a larger portion of the patient's lung in surgery to remove 
the lesion. Alternatively, a smaller lesion with expression of a polynucleotide that is 
associated with high metastatic potential may justify a more radical removal of lung tissue 

10 and/or the surrounding lymph nodes, even if no metastasis can be identified through 
pathological examination. 

Detection of breast cancer . The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 1) ductal carcinoma in 
situ (DCIS), including comedocarcinoma; 2) infiltrating (or invasive) ductal carcinoma 

15 (IDC); 3) lobular carcinoma in situ (LCIS); 4) infiltrating (or invasive) lobular carcinoma 
(ILC); 5) inflammatory breast cancer; 6) medullary carcinoma; 7) mucinous carcinoma; 
8) Paget' s disease of the nipple; 9) Phyllodes tumor; and 10) tubular carcinoma. 

The expression of polynucleotides of the invention can be used in the 
diagnosis and management of breast cancer, as well as to distinguish between types of 

20 breast cancer. Detection of breast cancer can be determined using expression levels of any 
of the appropriate polynucleotides of the invention, either alone or in combination. 
Determination of the aggressive nature and/or the metastatic potential of a breast cancer can 
also be determined by comparing levels of one or more polynucleotides of the invention 
and comparing levels of another sequence known to vary in cancerous tissue, e.g., ER 

25 expression. In addition, development of breast cancer can be detected by examining the 
ratio of expression of a differentially expressed polynucleotide to the levels of steroid 
hormones (e.g., testosterone or estrogen) or to other hormones (e.g., growth hormone, 
insulin). Thus expression of specific marker polynucleotides can be used to discriminate 
between normal and cancerous breast tissue, to discriminate between breast cancers with 

30 different cells of origin, to discriminate between breast cancers with different potential 
metastatic rates, etc. 

Detection of colon cancer . The polynucleotides of the invention exhibiting 
the appropriate expression pattern can be used to detect colon cancer in a subject. 
Colorectal cancer is one of the most common neoplasms in humans and perhaps the most 

35 frequent form of hereditary neoplasia. Prevention and early detection are key factors in 
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controlling and curing colorectal cancer. Colorectal cancer begins as polyps, which are 
small, benign growths of cells that form on the inner lining of the colon. Over a period of 
several years, some of these polyps accumulate additional mutations and become 
cancerous- Multiple familial colorectal cancer disorders have been identified, which are 
5 summarized as follows: 1) Familial adenomatous polyposis (FAP); 2) Gardner's syndrome; 
3) Hereditary nonpolyposis colon cancer (HNPCC); and 4) Familial colorectal cancer in 
Ashkenazi Jews. The expression of appropriate polynucleotides of the invention can be 
used in the diagnosis, prognosis and management of colorectal cancer. Detection of colon 
cancer can be determined using expression levels of any of these sequences alone or in 

10 combination with the levels of expression. Determination of the aggressive nature and/or 
the metastatic potential of a colon cancer can be determined by comparing levels of one or 
more polynucleotides of the invention and comparing total levels of another sequence 
known to vary in cancerous tissue, e.g., expression of p53, DCC ras, lor FAP (see, e.g., 
Fearon ER, et al., Cell (1990) 67(5):759; Hamilton SR et al. Cancer (1993) 72:957; 

15 Bodmer W, et al., Nat Genet. (1994) 4(5):217; Fearon ER, Ann N Y Acad Sci. (1995) 
768:101). For example, development of colon cancer can be detected by examining the 
ratio of any of the polynucleotides of the invention to the levels of oncogenes (e.g., ras) or 
tumor suppressor genes (e.g., FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous colon tissue, to 

20 discriminate between colon cancers with different cells of origin, to discriminate between 
colon cancers with different potential metastatic rates, etc. 

Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 

Polypeptides encoded by the instant polynucleotides and corresponding ftill 
length genes can be used to screen peptide libraries to identify binding partners, such as 

25 receptors, from among the encoded polypeptides. Peptide libraries can be synthesized 
according to methods known in the art (see, e.g., U.S. Patent No. 5,010,175, and WO 
91/17823). Agonists or antagonists of the polypeptides if the invention can be screened 
using any available method known in the art, such as signal transduction, antibody binding, 
receptor binding, mitogenic assays, chemotaxis assays, etc. The assay conditions ideally 

30 should resemble the conditions under which the native activity is exhibited in vivo, that is, 
under physiologic pH, temperature, and ionic strength. Suitable agonists or antagonists 
will exhibit strong inhibition or enhancement of the native activity at concentrations that do 
not cause toxic side effects in the subject. Agonists or antagonists that compete for binding 
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to the native polypeptide can require concentrations equal to or greater than the native 
concentration, while inhibitors capable of binding irreversibly to the polypeptide can be 
added in concentrations on the order of the native concentration. 

Such screening and experimentation can lead to identification of a novel 
5 polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA 
corresponding to a polynucleotide of the invention, and at least one peptide agonist or 
antagonist of the novel binding partner. Such agonists and antagonists can be used to 
modulate, enhance, or inhibit receptor function in cells to which the receptor is native, or in 
cells that possess the receptor as a result of genetic engineering. Further, if the novel 
10 receptor shares biologically important characteristics with a known receptor, information 
about agonist/antagonist binding can facilitate development of improved 
agonists/antagonists of the known receptor. 

Pharmaceutical Compositions and Therapeutic Uses 

Pharmaceutical compositions of the invention can comprise polypeptides, 

15 antibodies, or polynucleotides (including antisense nucleotides and ribozymes) of the 
claimed invention in a therapeutically effective amount. The term "therapeutically effective 
amount" as used herein refers to an amount of a therapeutic agent to treat, ameliorate, or 
prevent a desired disease or condition, or to exhibit a detectable therapeutic or preventative 
effect. The effect can be detected by, for example, chemical markers or antigen levels. 

20 Therapeutic effects also include reduction in physical symptoms, such as decreased body 
temperature. The precise effective amount for a subject will depend upon the subject's size 
and health, the nature and extent of the condition, and the therapeutics or combination of 
therapeutics selected for administration. Thus, it is not useful to specify an exact effective 
amount in advance. However, the effective amount for a given situation is determined by 

25 routine experimentation and is within the judgment of the clinician. For purposes of the 
present invention, an effective dose will generally be from about 0.01 mg/ kg to 50 mg/kg 
or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to which it is 
administered. 

A pharmaceutical composition can also contain a pharmaceutically 
30 acceptable carrier. The term "pharmaceutically acceptable carrier" refers to a carrier for 
administration of a therapeutic agent, such as antibodies or a polypeptide, genes, and other 
therapeutic agents. The term refers to any pharmaceutical carrier that does not itself induce 
the production of antibodies harmful to the individual receiving the composition, and which 
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can be administered without undue toxicity. Suitable carriers can be large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus 
particles. Such carriers are well known to those of ordinary skill in the art. 
5 Pharmaceutical^ acceptable carriers in therapeutic compositions can include liquids such 
as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or emulsifying 
agents, pH buffering substances, and the like, can also be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 

10 vehicles prior to injection can also be prepared. Liposomes are included within the 
definition of a pharmaceutical^ acceptable carrier, Pharmaceutically acceptable salts can 
also be present in the pharmaceutical composition, e.g., mineral acid salts such as 
hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of organic 
acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 

15 discussion of pharmaceutically acceptable excipients is available in Remingtons 
Pharmaceutical Sciences (Mack Pub. Co., New Jersey, 1991). 

Delivery Methods . Once formulated, the compositions of the invention can 
be (1) administered directly to the subject {e.g., as polynucleotide or polypeptides); or 
(2) delivered ex vivo, to cells derived from the subject (e.g., as in ex vivo gene therapy). 

20 Direct delivery of the compositions will generally be accomplished by parenteral injection, 
e.g., subcutaneously, intraperitoneally, intravenously or intramuscularly, intratumoral or to 
the interstitial space of a tissue. Other modes of administration include oral and pulmonary 
administration, suppositories, and transdermal applications, needles, and gene guns or 
hyposprays. Dosage treatment can be a single dose schedule or a multiple dose schedule. 

25 Methods for the ex vivo delivery and reimplantation of transformed cells 

into a subject are known in the art and described in e.g., International Publication No. WO 
93/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 
particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 
Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 

30 accomplished by, for example, dextran-mediated transfection, calcium phosphate 
precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 
encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been 

35 found to correlate with a proliferative disorder, such as neoplasia, dysplasia, and 
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hyperplasia, the disorder can be amenable to treatment by administration of a therapeutic 
agent based on the provided polynucleotide, corresponding polypeptide or other 
corresponding molecule (e.g., antisense, ribozyme, etc.). 

The dose and the means of administration of the inventive pharmaceutical 
5 compositions are determined based on the specific qualities of the therapeutic composition, 
the condition, age, and weight of the patient, the progression of the disease, and other 
relevant factors. For example, administration of polynucleotide therapeutic compositions 
agents of the invention includes local or systemic administration, including injection, oral 
administration, particle gun or catheterized administration, and topical administration. 

10 Preferably, the therapeutic polynucleotide composition contains an expression construct 
comprising a promoter operably linked to a polynucleotide of at least 12, 22, 25, 30, or 35 
contiguous nt of the polynucleotide disclosed herein. Various methods can be used to 
administer the therapeutic composition directly to a specific site in the body. For example, 
a small metastatic lesion is located and the therapeutic composition injected several times 

15 in several different locations within the body of tumor. Alternatively, arteries which serve 
a tumor are identified, and the therapeutic composition injected into such an artery, in order 
to deliver the composition directly into the tumor. A tumor that has a necrotic center is 
aspirated and the composition injected directly into the now empty center of the tumor. 
The antisense composition is directly administered to the surface of the tumor, for example, 

20 by topical application of the composition. X-ray imaging is used to assist in certain of the 
above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing 
an antisense polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues 
can also be used. Receptor-mediated DNA delivery techniques are described in, for 

25 example, Findeis et al., Trends Biotechnol (1993) 77:202; Chiou et al., Gene Therapeutics: 
Methods And Applications Of Direct Gene Transfer (J.A. Wolff, ed.) (1994); Wu et al., J. 
Biol Chem. (1988) 255:621; Wu et al., 1 Biol Chem. (1994) 259:542; Zenke et al., Proc. 
Natl Acad Sci (USA) (1990) 57:3655; Wu et al., J. Biol Chem. (1991) 266:338. 
Therapeutic compositions containing a polynucleotide are administered in a range of about 

30 100 ng to about 200 mg of DNA for local administration in a gene therapy protocol. 
Concentration ranges of about 500 ng to about 50 mg, about 1 mg to about 2 mg, about 5 
mg to about 500 mg, and about 20 mg to about 100 mg of DNA can also be used during a 
gene therapy protocol. Factors such as method of action {e.g., for enhancing or inhibiting 
levels of the encoded gene product) and efficacy of transformation and expression are 

35 considerations which will affect the dosage required for ultimate efficacy of the antisense 
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subgenomic polynucleotides. Where greater expression is desired over a larger area of 
tissue, larger amounts of antisense subgenomic polynucleotides or the same amounts 
readministered in a successive protocol of administrations, or several administrations to 
different adjacent or close tissue portions of, for example, a tumor site, may be required to 
5 effect a positive therapeutic outcome. In all cases, routine experimentation in clinical trials 
will determine specific ranges for optimal therapeutic effect. For polynucleotide-related 
genes encoding polypeptides or proteins with anti-inflammatory activity, suitable use, 
doses, and administration are described in U.S. Patent No. 5,654,173. 

The therapeutic polynucleotides and polypeptides of the present invention 

10 can be delivered using gene delivery vehicles. The gene delivery vehicle can be of viral or 
non- viral origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human 
Gene Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, 
Nature Genetics (1994) 6:148). Expression of such coding sequences can be induced using 
endogenous mammalian or heterologous promoters. Expression of the coding sequence 

1 5 can be either constitutive or regulated. 

Viral-based vectors for delivery of a desired polynucleotide and expression 
in a desired cell are well known in the art. Exemplary viral-based vehicles include, but are 
not limited to, recombinant retroviruses (see, e.g., WO 90/07936; WO 94/03622; WO 
93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/11230; WO 93/10218; U.S. 

20 Patent No. 4,777,127; GB Patent No. 2,200,651; EP 0 345 242; and WO 91/02805), 
alphavirus-based vectors {e.g., Sindbis virus vectors, Semliki forest virus (ATCC VR-67; 
ATCC VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan 
equine encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR- 
532), and adeno-associated virus (AAV) vectors (see, e.g., WO 94/12649, WO 93/03769; 

25 WO 93/19191; WO 94/28938; WO 95/11984 and WO 95/00655). Administration of DNA 
linked to killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 5:147 can also 
be employed. 

Non-viral delivery vehicles and methods can also be employed, including, 
but not limited to, polycationic condensed DNA linked or unlinked to killed adenovirus 

30 alone (see, e.g., Curiel, Hum. Gene Ther. (1992) 5:147); ligand-linked DNA(see, e.g., Wu, 
J. Biol Chem. 264:16985 (1989)); eukaryotic cell delivery vehicles cells (see, e.g., U.S. 
Patent No. 5,814,482; WO 95/07994; WO 96/17072; WO 95/30763; and WO 97/42338) 
and nucleic charge neutralization or fusion with cell membranes. Naked DNA can also be 
employed. Exemplary naked DNA introduction methods are described in WO 90/11092 

35 and U.S. Patent No. 5,580,859. Liposomes that can act as gene delivery vehicles are 
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described in U.S. Patent No. 5,422,120; WO 95/13796; WO 94/23697; WO 91/14445; and 
EP 0524968. Additional approaches are described in Philip, Mol. Cell Biol 74:2411 
(1994), and in Woffendin, Proc. Natl Acad. ScL (1994) 97:1581. 

Further non-viral delivery suitable for use includes mechanical delivery 
5 systems such as the approach described in Woffendin et al., Proc. Natl Acad Sci. USA 
97(24): 11581 (1994). Moreover, the coding sequence and the product of expression of 
such can be delivered through deposition of photopolymerized hydrogel materials or use of 
ionizing radiation (see, e.g., U.S. Patent No. 5,206,152 and WO 92/11033). Other 
conventional methods for gene delivery that can be used for delivery of the coding 

10 sequence include, for example, use of hand-held gene transfer particle gun (see, e.g., U.S. 
Patent No. 5,149,655); use of ionizing radiation for activating transferred gene (see, e.g., 
U.S. Patent No. 5,206,152 and WO 92/11033). 

The present invention will now be illustrated by reference to the following 
examples which set forth particularly advantageous embodiments. However, it should be 

15 noted that these embodiments are illustrative and are not to be construed as restricting the 
invention in any way. 
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EXAMPLES 
EXAMPLE 1 

Source of Biological Materials and Overview of Novel Polynucleotides 
Expressed by the Biological Materials 

5 

Cell lines and human normal and tumor tissue were used to construct cDNA 
libraries from mRNA isolated from the cells and tissues. Most sequences were about 275- 
300 nucleotides in length. The cells lines include Kml2L4-A cell line, a high metastatic 
colon cancer cell line (Morika, W. A. K. et ah, Cancer Research (1988) ¥5:6863). The 

10 KM12L4-A cell line is derived from the KM12C cell line. The KM12C cell line, which is 
poorly metastatic (low metastatic) was established in culture from a Dukes' stage B2 
surgical specimen (Morikawa et al. Cancer Res. (1988) 4#:6863). The KML4-A is a highly 
metastatic subline derived from KM12C (Yeatman et al. Nucl Acids, Res. (1995) 23:4007; 
Bao-Ling et al. Proc. Annu. Meet Am. Assoc Cancer. Res. (1995) 27:3269). The KM12C 

15 and KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well-recognized in the 
art as model cell lines for the study of colon cancer (see, e.g., Moriakawa et al., supra; 
Radinsky et al. Clin. Cancer Res. (1995) 1:19; Yeatman et al., (1995) supra; Yeatman et 
al., Clin. Exp. Metastasis (1996) 14:246). These and other cell lines and tissue are 
described in Table 6. 

20 The sequences of the isolated polynucleotides were first masked to eliminate 

low complexity sequences using the XBLAST masking program (Claverie "Effective 
Large-Scale Sequence Similarity Searches," In: Computer Methods for Macromolecular 
Sequence Analysis , Doolittle, ed., Meth. Enzymol 266:212-227 Academic Press, NY, NY 
(1996); see particularly Claverie, in "Automated DNA Sequencing and Analysis 

25 Techniques" Adams et al., eds., Chap. 36, p. 267 Academic Press, San Diego, 1994 and 
Claverie et al. Comput. Chem. (1993) 77:191 ). Generally, masking does not influence the 
final search results, except to eliminate sequences of relative little interest due to their low 
complexity, and to eliminate multiple "hits" based on similarity to repetitive regions 
common to multiple sequences, e.g., AIu repeats. The sequences remaining after masking 

30 were then used in a BLASTN vs. Genbank search; sequences that exhibited greater than 
70% overlap, 99% identity, and a p value of less than 1 x 10" 40 were discarded. Sequences 
from this search also were discarded if the inclusive parameters were met, but the sequence 
was ribosomal or vector-derived. 



50 



PP01598.002/ 200130.512 



The resulting sequences from the previous search were classified into three 
groups (1,2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) 
database search: (1) unknown (no hits in the Genbank search), (2) weak similarity (greater 
than 45% identity and p value of less than 1 x 10" 5 ), and (3) high similarity (greater than 
5 60% overlap, greater than 80% identity, and p value less than 1 x 10" 5 ). Sequences having 
greater than 70% overlap, greater than 99% identity, and p value of less than 1 x 10" 40 were 
discarded. 

The remaining sequences were classified as unknown (no hits), weak 
similarity, and high similarity (parameters as above). Two searches were performed on 

10 these sequences. First, a BLAST vs. EST database search was performed and sequences 
with greater than 99% overlap, greater than 99% similarity and a p value of less than 1 x 
10* 40 were discarded. Sequences with a p value of less than 1 x 10" 65 when compared to a 
database sequence of human origin were also excluded. Second, a BLASTN vs. Patent 
GeneSeq database was performed and sequences having greater than 99% identity, p value 

15 less than 1 x 10* 40 , and greater than 99% overlap were discarded. 

The remaining sequences were subjected to screening using other rules and 
redundancies in the dataset. Sequences with a p value of less than 1 x 10" m in relation to a 
database sequence of human origin were specifically excluded. The final result provided 
the 3351 sequences listed in the accompanying Sequence Listing. Each identified 

20 polynucleotide represents sequence from at least a partial mRNA transcript. 
Polynucleotides that were determined to be novel were assigned a sequence identification 
number. 

The novel polynucleotides were assigned sequence identification numbers SEQ 
ID NOs: 1-3351. The first 1847 DNA sequences corresponding to the novel polynucleotides 

25 are provided in the Sequence Listing in Table 1 . DNA sequences corresponding to the novel 
polynucleotides of SEQ ID NOs.T 848-3351 are provided in the Sequence Listing in Table 2. 
The DNA sequences of Table 2, while numbered SEQ ID 1-1504, correspond to SEQ ID 
NOs:1848-3351 in the Sequence Listing, e.g., Table 2 SEQ ID 1 is SEQ ID NO:1848, Table 2 
SEQ ID 2 is SEQ ID NO: 1849, etc. Each DNA sequence in Table 4 is uniquely identified by a 

30 number that is 1 847 less than its SEQ ID NO in the Sequence Listing. Tables 1 and 2 provide: 
]) the SEQ ID NO assigned to each sequence for use in the present specification or a 
corresponding number; 2) the sequence name used as an internal identifier of the sequence; 
3) the name assigned to the clone from which the sequence was isolated; and 4) the number of 
the cluster to which the sequence is assigned (Cluster ID; where the cluster ID is 0, the 

35 sequence was not assigned to any cluster). 
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Because the provided polynucleotides represent partial mRNA transcripts, 
two or more polynucleotides of the invention may represent different regions of the same 
mRNA transcript and the same gene. Thus, if two or more SEQ ID NOs: are identified as 
belonging to the same clone, then either sequence can be used to obtain the full-length 
5 mRNA or gene. 

EXAMPLE 2 

Results of Public Database Search to Identify Function of Gene Products 



SEQ ID NOs: 1-3351 were translated in all three reading frames to determine 

10 the best alignment with the individual sequences. These amino acid sequences and 
nucleotide sequences are referred to, generally, as query sequences, which are aligned with 
the individual sequences. Query and individual sequences were aligned using the BLAST 
programs, available over the world wide web at http://www.ncbi.nlm.nih.gov/BLAST/. 
Again the sequences were masked to various extents to prevent searching of repetitive 

1 5 sequences or poly-A sequences, using the XBL AST program for masking low complexity 
as described above in Example 1 . 

Tables 3 and 4 (inserted before the claims) show the results of the alignments. 
Table 3 contains alignment information for SEQ ID NOs.T-1847 and Table 4 contains 
alignment information for SEQ ID NOs.T 848-3351. The DNA sequences of Table 4, while 

20 numbered SEQ ID 1-1504, correspond to SEQ ID NOs: 1848-3351. Each DNA sequence in 
Table 4 is uniquely identified by a number that is 1847 less than its SEQ ID NO. Tables 3 and 
4 refer to each sequence by its SEQ ID NO or a corresponding number, the accession numbers 
and descriptions of nearest neighbors from the Genbank and Non-Redundant Protein searches, 
and the p values of the search results. 

25 For each of SEQ ID NOs: 1-1 847, the best alignment to a protein or DNA 

sequence is included in Table 3, and the best alignment for each of SEQ ID NOs: 1848-3351 is 
included in Table 4. The activity of the polypeptide encoded by SEQ ID NOs: 1-3351 is the 
same or similar to the nearest neighbor reported in Table 3 or 4. The accession number of the 
nearest neighbor is reported, providing a reference to the activities exhibited by the nearest 

30 neighbor. The search program and database used for the alignment also are indicated as well 
as a calculation of the p value. 

Full length sequences or fragments of the polynucleotide sequences of the 
nearest neighbors can be used as probes and primers to identify and isolate the full length 
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sequence of SEQ ID NOs: 1-3351. The nearest neighbors can indicate a tissue or cell type 
to be used to construct a library for the full-length sequences of SEQ ID NOs: 1-3351. 

EXAMPLE 3 
Members of Protein Families 

5 

The sequences (SEQ ID NOs: 1-3351) were used to conduct a profile search 
as described in the specification above. Several of the polynucleotides of the invention 
were found to encode polypeptides having characteristics of a polypeptide belonging to a 
known protein families (and thus represent new members of these protein families) and/or 

10 comprising a known functional domain (Table 5). "Start" and "stop" in Table 3 indicate the 
position within the individual sequences that align with the query sequence having the 
indicated SEQ ID NO. The direction indicates the orientation of the query sequence with 
respect to the individual sequence, where forward (for) indicates that the alignment is in the 
same direction (left to right) as the sequence provided in the Sequence Listing and reverse 

15 (rev) indicates that the alignment is with a sequence complementary to the sequence 
provided in the Sequence Listing. 

Some polynucleotides exhibited multiple profile hits because, for example, 
the particular sequence contains overlapping profile regions, and/or the sequence contains 
two different functional domains. These profile hits are described in more detail below. 

20 Ank Repeats (ANK) . SEQ ID NOs: 187, 1268, 1804, 1819, 1830, 1839, 

2652, 3015 and 3267 represent polynucleotides encoding an Ank repeat-containing protein. 
The ankyrin motif is a 33 amino acid sequence named for the protein ankyrin which has 24 
tandem 33-amino-acid motifs. Ank repeats were originally identified in the cell-cycle- 
control protein cdclO (Breeden et al., Nature (1987) 329:651). Proteins containing ankyrin 

25 repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle protein cdclO, the Notch 
receptor (Matsuno et al., Development (1997) 124(21)A265); G9a (or BAT8) of the class 
III region of the major histocompatibility complex (Biochem J. 290:81 1-818, 1993), FABP, 
GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions of the ankyrin repeats are 
compatible with a role in protein-protein interactions (Bork, Proteins (1993) 77(4):363; 

30 Lambert and Bennet, Eur. J. Biochem. (1993) 277:1; Kerr et al., Current Op. Cell Biol 
(1992) 4:496; Bennet et al., 1 Biol. Chem. (1980) 255:6424). 

ATPases Associated with Various Cellular Activities (ATPases) . Sequences 
within SEQ ID NOs:431, 639, 2135, 2684, 2859, 3197 and 3266 correspond to a sequence 
that encodes a novel member of the "ATPases Associated with diverse cellular Activities" 
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(AAA) protein family. The AAA protein family is composed of a large number of 
ATPases that share a conserved region of about 220 amino acids that contains an ATP- 
binding site (Froehlich et al., J. Cell Biol (1991) 774:443; Erdmann et al., Cell (1991) 
64:499; Peters et al., EMBO 1 (1990) 9:1757; Kunau et al., Biochimie (1993) 75:209-224; 
5 Confalonieri et al., BioEssays (1995) 77:639; http://yeamob.pci.chemie.uni- 
tuebingen.de/AAA/Description.html). The proteins that belong to this family either contain 
one or two AAA domains. In general, the AAA domains in these proteins act as ATP- 
dependent protein clamps (Confalonieri et al. (1995) BioEssays 77:639). In addition to the 
ATP-binding 'A' and f B' motifs, which are located in the N-terminal half of this domain, 

10 there is a highly conserved region located in the central part of the domain which was used 
in the development of the signature pattern. The consensus pattern is: [LIVMT]-x- 
[LIVMT]-[LIVMF]-x-[GATMC]-[ST]-[NS]-x(4)-[LIVM]^ D-x-A-[LIFA]-x-R. 

Bromodomain (bromodomain) . SEQ ID NO: 1814 represents a 
polynucleotide encoding a polypeptide having a bromodomain region (Haynes et al., 1992, 

15 Nucleic Acids Res. 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and Tamkun, 
1995, Curr. Opin. Genet. Dev. 5:473-477), which is a conserved region of about 70 amino 
acids. The bromodomain is thought to be involved in protein-protein interactions and may 
be important for the assembly or activity of multicomponent complexes involved in 
transcriptional activation. The consensus pattern, which spans a major part of the 

20 bromodomain, is: [STANVF]-x(2)-F-x(4)-[DNS]-x(57 
[LIVMFY]-x(3HLIVM]-x(4)-[U^ 
[FY]. 

Basic Region Plus Leucine Zipper Transcription Factors (BZIP) . SEQ ID 
NOs:410, 552, 768, 822, 836, 1288, 1365, 1454, 1540, 1549, 1556, 1557, 1563, 1622, 

25 1630, 1704, 1808, 2363, 2424, 3147, 3152, 3158 and 3208 represent polynucleotides 
encoding a novel member of the family of basic region plus leucine zipper transcription 
factors. The bZIP superfamily (Hurst, Protein Prof. (1995) 2:105; and Ellenberger, Curr. 
Opin. Struct. Biol (1994) 4:12) of eukaryotic DNA-binding transcription factors 
encompasses proteins that contain a basic region mediating sequence-specific DNA- 

30 binding followed by a leucine zipper required for dimerization. The consensus pattern for 
this protein family is: [KR]-x(l,3)-[RKSAQ]-N-x(2)-[SAQ](2)-x-[RKTAENQ]-x-R-x- 
[RK]. 

EF Hand (EFhand) . SEQ ID NOs:820, 1755 and 3285 correspond to 
polynucleotides encoding a novel protein in the family of EF-hand proteins. Many 
35 calcium-binding proteins belong to the same evolutionary family and share a type of 
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calcium-binding domain known as the EF-hand (Kawasaki et al., Protein. Prof (1995) 
2:305-490). This type of domain consists of a twelve residue loop flanked on both sides by 
a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated 
in a pentagonal bipyramidal configuration. The six residues involved in the binding are in 
5 positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The 
invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate 
ligand). The consensus pattern includes the complete EF-hand loop as well as the first 
residue which follows the loop and which seem to always be hydrophobic: D-x-[DNS]- 
{ILVFYW}-[DENSTG]-[^ 
10 [LIVMFYW]. 

Ets Domain (Ets Nterm) . SEQ ID NO: 1811 represents a polynucleotide 
encoding a polypeptide with N-terminal homology in ETS domain. Proteins of this family 
contain a conserved domain, the "ETS-domain," that is involved in DNA binding. The 
domain appears to recognize purine-rich sequences; it is about 85 to 90 amino acids in 

15 length, and is rich in aromatic and positively charged residues (Wasylyk, et al., Eur. J. 
Biochem. (1993) 277:718). The ets gene family encodes a novel class of DNA-binding 
proteins, each of which binds a specific DNA sequence and comprises an ets domain that 
specifically interacts with sequences containing the common core tri-nucleotide sequence 
GGA. In addition to an ets domain, native ets proteins comprise other sequences which can 

20 modulate the biological specificity of the protein. Ets genes and proteins are involved in a 
variety of essential biological processes including cell growth, differentiation and 
development, and three members are implicated in oncogenic process. 

G-Protein Alpha Subunit (G-alpha) . SEQ ID NO: 1846 represents a 
polynucleotide encoding a novel polypeptide of the G-protein alpha subunit family, 

25 Guanine nucleotide binding proteins (G-proteins) are a family of membrane-associated 
proteins that couple extracellularly-activated integral-membrane receptors to intracellular 
effectors, such as ion channels and enzymes that vary the concentration of second 
messenger molecules. G-proteins are composed of 3 subunits (alpha, beta and gamma) 
which, in the resting state, associate as a trimer at the inner face of the plasma membrane. 

30 The alpha subunit binds GTP and exhibits GTPase activity. G-protein alpha subunits are 
350-400 amino acids in length and have molecular weights in the range 40-45 kDa. 
Seventeen distinct types of alpha subunit have been identified in mammals, and fall into 4 
main groups on the basis of both sequence similarity and function: alpha-s, alpha-q, alpha-i 
and alpha-12 (Simon et al., Science (1993) 252:802). They are often N-terminally acylated, 
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usually with myristate and/or palmitoylate, and these fatty acid modifications can be 
important for membrane association and high- affinity interactions with other proteins. 

Helicases conserved C-terminal domain (helicase C) . SEQ ID NOs:1496, 
2826 and 2871 represent polynucleotides encoding novel members of the DEAD/H 
5 helicase family. A number of eukaryotic and prokaryotic proteins have been characterized 
(Schmid S.R., et al., Mol Microbiol (1992) 5:283; Linder P., et al., Nature (1989) 
557:121; Wassarman D.A., et al., Nature (1991) 349:463) on the basis of their structural 
similarity. All are involved in ATP-dependent, nucleic-acid unwinding. All DEAD box 
family members of the above proteins share a number of conserved sequence motifs, some 

10 of which are specific to the DEAD family while others are shared by other ATP -binding 
proteins or by proteins belonging to the helicases 'superfamily' (Hodgman T.C., Nature 
(1988) 555:22 and Nature (1988) 555:578 (Errata). One of these motifs, called the "D-E- 
A-D-box", represents a special version of the B motif of ATP-binding proteins. Some 
other proteins belong to a subfamily which have His instead of the second Asp and are thus 

15 said to be "D-E-A-H-box" proteins (Wassarman D.A., et al., Nature (1991) 349:463; 
Harosh I., et al., Nucleic Acids Res, (1991) 7P:6331; Koonin E.V. et al., I Gen. Virol 
(1992) 75:989. The following signature patterns are used to identify members of both 
subfamilies: 1) [LIVMF](2)-D-E-A-D-[RKEN]-x-[LIVMFYGSTN]; and 2) [GSAH]-x- 
[LIVMF](3)-D-E-[ALIV]-H-[NECR]. 

20 Homeobox domain (homeobox) . SEQ ID NOs:1676, 1820 and 1821 

represent polynucleotides encoding proteins having a homeobox domain. The homeobox is 
a protein domain of 60 amino acids (Gehring In: Guidebook to the Homeobox Genes , 
Duboule D., Ed., pp. 1-10, Oxford University Press, Oxford, (1994); Buerglin In: 
Guidebook to the Homeobox Genes , pp25-72, Oxford University Press, Oxford, (1994); 

25 Gehring, Trends Biochem. ScL (1992) 17:277-280; Gehring et al., Annu. Rev. Genet (1986) 
20:147-173; Schofield, Trends Neuroscl (1987) 70:3-6) first identified in a number of 
Drosophila homeotic and segmentation proteins. It is extremely well conserved in many 
other animals, including vertebrates. This domain binds DNA through a helix-turn-helix 
type of structure. Several proteins that contain a homeobox domain play an important role 

30 in development. Most of these proteins are sequence-specific DNA-binding transcription 
factors. The homeobox domain is also very similar to a region of the yeast mating type 
proteins. These are sequence-specific DNA-binding proteins that act as master switches in 
yeast differentiation by controlling gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. The 

35 helix-turn-helix region is shown by the symbols 'H' (for helix), and T (for turn). 
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xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHtttHHHHHHHHHxxxxxxxxxx 
1 60 

5 The pattern detects homeobox sequences 24 residues long and spans 

positions 34 to 57 of the homeobox domain. The consensus pattern is as follows: 
[LIVMFYG]-[ASLVR]-x(2)-[LIVMST^^ 
[LIVFSTNKH]-W-[FYVC]-x-[NDQTAH]-x(5)-[RKNAIMW]. 

MAP kinase kinase (mkkV SEQ ID NOs:29, 31, 196, 3175, 3190 and 3281 

10 represent novel members of the MAP kinase kinase family. MAP kinases (MAPK) are 
involved in signal transduction, and are important in cell cycle and cell growth controls. 
The MAP kinase kinases (MAPKK) are dual-specificity protein kinases which 
phosphorylate and activate MAP kinases. MAPKK homologues have been found in yeast, 
invertebrates, amphibians, and mammals. Moreover, the MAPKK/MAPK phosphorylation 

15 switch constitutes a basic module activated in distinct pathways in yeast and in vertebrates. 
MAPKKs are essential transducers through which signals must pass before reaching the 
nucleus. For review, see, e.g., Biologique Biol Cell (1993) 79: 193-207; Nishida et aL. 
Trends Biochem Sci (1993) 75:128-31; Ruderman, Curr Opin Cell Biol (1993) 5:207-13; 
Dhanasekaran et aL, Oncogene (1998) 77:1447-55; Kiefer et aL, Biochem Soc Trans (1997) 

20 25:491-8; and Hill, Cell Signal (1996) 5:533-44. 

Protein Kinase (protkinase) . SEQ ID NOs:1157, 1478, 1496, 2286, 2969 
and 3190 represent polynucleotides encoding protein kinases. Protein kinases catalyze 
phosphorylation of proteins in a variety of pathways, and are implicated in cancer. 
Eukaryotic protein kinases (Hanks S.K., et aL, FASEB J. (1995) 9:576; Hunter T., Meth. 

25 Enzymol (1991) 200:3; Hanks S.K., et aL, Metk Enzymol (1991) 200:3$; Hanks S.K., 
Curr. Opin. Struct. Biol. (1991) 7:369; Hanks S.K. et aL, Science (1988) 241:42) are 
enzymes that belong to a very extensive family of proteins which share a conserved 
catalytic core common to both serine/threonine and tyrosine protein kinases. There are a 
number of conserved regions in the catalytic domain of protein kinases. The first region, 

30 which is located in the N-terminal extremity of the catalytic domain, is a glycine-rich 
stretch of residues in the vicinity of a lysine residue, which has been shown to be involved 
in ATP binding. The second region, which is located in the central part of the catalytic 
domain, contains a conserved aspartic acid residue which is important for the catalytic 
activity of the enzyme (Knighton D.R. et aL, Science (1991) 255:407). The protein kinase 

35 profile includes two signature patterns for this second region: one specific for 
serine/threonine kinases and the other for tyrosine kinases. A third profile is based on the 
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alignment in (Hanks S.K. et al., FASEB J. (1995) 9:576) and covers the entire catalytic 
domain. 

The consensus patterns are as follows: 1) [LIV]-G-{P}-G-{P}» 
[FYWMGSTOH]-[SGA]-{PW}-[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5 5 18> 
5 [LIVMFYWCSTAR]-[AIVP]-[LIVMFAGCKR]-K ? where K binds ATP; 2) [LIVMFYC]- 
x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3) ? where D is an active site residue; and 
3) [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-[RSTAC]-x(2)-N-[LIVMFYC] 5 where D is an 
active site residue. 

If a protein analyzed includes two of the above protein kinase signatures, the 

10 probability of it being a protein kinase is close to 100%. 

Ras family proteins (ras) . SEQ ID NOs:1688 and 3258 represent 
polynucleotides encoding novel members of the ras family of small GTP/GDP-binding 
proteins (Valencia et al., 1991, Biochemistry 30:4637-4648). Ras family members 
generally require a specific guanine nucleotide exchange factor (GEF) and a specific 

15 GTPase activating protein (GAP) as stimulators of overall GTPase activity. Among ras- 
related proteins, the highest degree of sequence conservation is found in four regions that 
are directly involved in guanine nucleotide binding. The first two constitute most of the 
phosphate and Mg2+ binding site (PM site) and are located in the first half of the G- 
domain. The other two regions are involved in guanosine binding and are located in the C- 

20 terminal half of the molecule. Motifs and conserved structural features of the ras-related 
proteins are described in Valencia et al, 1991, Biochemistry 30:4637-4648. A major 
consensus pattern of ras proteins is: D-T-A-G-Q-E-K-[LF]-G-G-L-R-[DE]-G-Y-Y. 

Thioredoxin family active site (Thioredox) , SEQ ID NO: 1677 represents a 
polynucleotide encoding a protein having a thioredoxin family active site. Thioredoxins 

25 (Holmgren A., Annu. Rev. Biochem. (1985) 54:237; Gleason F.K. et al., FEMS Microbiol 
Rev. (1988) 54:271; Holmgren, A. J. Biol Chem. (1989) 264:13963; Eklund H. et al., 
Proteins (1991) 77:13) are small proteins of approximately one hundred amino- acid 
residues which participate in various redox reactions via the reversible oxidation of an 
active center disulfide bond. They exist in either a reduced form or an oxidized form where 

30 the two cysteine residues are linked in an intramolecular disulfide bond. Thioredoxin is 
present in prokaryotes and eukaryotes and the sequence around the redox-active disulfide 
bond is well conserved. All PDI contains two or three (ERp72) copies of the thioredoxin 
domain. The consensus pattern is: [LIVMF]-[LIVMSTA]-x-[LIVMFYC]-[FYWSTHE]- 
x(2)-[FYWGTN]-C- [GATPLVE]-[PHYWSTA]-C-x(6)-[LIVMFYWT] (where the two Cs 

35 form the redox-active bond). 
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Trypsin (trypsin) . SEQ ID NO: 1410 corresponds to a novel serine protease 
of the trypsin family. The catalytic activity of the serine proteases from the trypsin family 
is provided by a charge relay system involving an aspartic acid residue hydrogen-bonded to 
a histidine, which itself is hydrogen-bonded to a serine. The sequences in the vicinity of 
5 the active site serine and histidine residues are well conserved in this family of proteases 
(Brenner S., Nature (1988) 554:528). The consensus patterns for this trypsin protein family 
are: 1) [LIVM]-[ST]-A-[STAG]-H-C, where H is the active site residue; and 2) 
[DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]- [LIVMFYWH]- 
[LIVMFYSTANQH], where S is the active site residue. All sequences known to belong to 
10 this family are detected by the above consensus sequences, except for 18 different proteases 
which have lost the first conserved glycine. If a protein includes both the serine and the 
histidine active site signatures, the probability of it being a trypsin family serine protease is 
100%. 

WD Domain, G-Beta Repeats (WD domain) . SEQ ID NOs:1336, 1380, 

15 1711, 1762, 1909, 2218, 3047, 3108 and 3292 represent novel members of the WD 
domain/G-beta repeat family. Beta-transducin (G-beta) is one of the three subunits (alpha, 
beta, and gamma) of the guanine nucleotide-binding proteins (G proteins) which act as 
intermediaries in the transduction of signals generated by transmembrane receptors 
(Oilman, Annu. Rev, Biochem, (1987) 56:615). The alpha subunit binds to and hydrolyzes 

20 GTP; the functions of the beta and gamma subunits are less clear but they seem to be 
required for the replacement of GDP by GTP as well as for membrane anchoring and 
receptor recognition. In higher eukaryotes, G-beta exists as a small multigene family of 
highly conserved proteins of about 340 amino acid residues. Structurally, G-beta consists 
of eight tandem repeats of about 40 residues, each containing a central Trp-Asp motif (this 

25 type of repeat is sometimes called a WD-40 repeat). The consensus pattern for the WD 
domain/G-Beta repeat family is: [LIVMSTAC]-[LIVMFYWSTAGC]-[LIMSTAG]- 
[LIVMSTAGC]-x(2)«[DN]-x(2)-[LIVMWSTAC]^x-[LIVMFSTAG]-W-[DEN]- 
[LIVMFSTAGCN]. 

wnt Family of Developmental Signaling Proteins (Wnt dev sign) . SEQ ID 

30 NO: 1538 corresponds to a novel member of the wnt family of developmental signaling 
proteins. Wnt-1 (previously known as int-1), the seminal member of this family, (Nusse 
R., Trends Genet. (1988) 4:291) is thought to play a role in intercellular communication 
and seems to be a signalling molecule important in the development of the central nervous 
system (CNS). All wnt family proteins share the following features characteristics of 

35 secretory proteins: a signal peptide, several potential N-glycosylation sites and 22 
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conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins seem 
to adhere to the plasma membrane of the secreting cells and are therefore likely to signal 
over only few cell diameters. The consensus pattern, which is based upon a highly 
conserved region including three cysteines, is as follows: C-K-C-H-G-[LIVMT]-S-G-x-C. 
5 Protein Tyrosine Phosphatase (Y phosphatase) . SEQ ID NO: 141 7 

represents a polynucleotide encoding a protein tyrosine kinase. Tyrosine specific protein 
phosphatases (EC 3.1.3.48) (PTPase) (Fischer et al., Science (1991) 253:401; Charbonneau 
et al., Annu. Rev. Cell Biol. (1992) 5:463; Trowbridge, J. Biol. Chem. (1991) 266:23517; 
Tonks et al., Trends Biochem. Set (1989) 74:497; and Hunter, Cell (1989) 55:1013) 

10 catalyze the removal of a phosphate group attached to a tyrosine residue. These enzymes 
are very important in the control of cell growth, proliferation, differentiation and 
transformation. Multiple forms of PTPase have been characterized and can be classified 
into two categories: soluble PTPases and transmembrane receptor proteins that contain 
PTPase domain(s). Structurally, all known receptor PTPases are made up of a variable 

15 length extracellular domain, followed by a transmembrane region and a C-terminal 
catalytic cytoplasmic domain. PTPase domains consist of about 300 amino acids. The 
search of two conserved cysteines has been shown to be absolutely required for activity. 
Furthermore, a number of conserved residues in its immediate vicinity have also been 
shown to be important. The consensus pattern for PTPases is: [LIVMF]-H-C-x(2)-G-x(3)- 

20 [STC]-[STAGP]-x-[LIVMFY]; C is the active site residue. 

Zinc Finger, C2H2 Type (Zincfing C2H2) . SEQ ID NOs:308, 807, 1324, 
1503, 1527, 3081, 3193 and 3306 correspond to polynucleotides encoding novel members 
of the of the C2H2 type zinc finger protein family. Zinc finger domains (Klug et al., Trends 
Biochem. Sci. (1987) 72:464; Evans et al., Cell (1988) 52:1; Payre et al., FEBS Lett. (1988) 

25 234:245; Miller et al., EMBO J. (1985) 4:1609; and Berg, Proc. Natl. Acad. Sci. USA 
(1988) 85:99) are nucleic acid-binding protein structures. In addition to the conserved zinc 
ligand residues, it has been shown that a number of other positions are also important for 
the structural integrity of the C2H2 zinc fingers. (Rosenfeld et al, J. Biomol. Struct. Dyn. 
(1993) 77:557) The best conserved position is found four residues after the second 

30 cysteine; it is generally an aromatic or aliphatic residue. The consensus pattern for C2H2 
zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]-x(8)-H-x(3,5>H. The two C's and two H's 
are zinc ligands. 

Src homology 2 . SEQ ID NOs:186, 2591, 3307 and 3339 represent 
polynucleotides encoding novel members of the family of Src homology 2 (SH2) proteins. 
35 The Src homology 2 (SH2) domain is a protein domain of about 100 amino acid residues 
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first identified as a conserved sequence region between the oncoproteins Src and Fps 
(Sadowski I. et al., Mol. Cell Biol. 6:4396-4408 (1986)). Similar sequences are found in 
many other intracellular signal-transducing proteins (Russel R.B. et al., FEES Lett. 504:15- 
20 (1992)). SH2 domains function as regulatory modules of intracellular signalling 
5 cascades by interacting with high affinity to phosphotyrosine-containing target peptides in 
a sequence-specific and phosphorylation-dependent manner (Marangere L.E.M., Pawson 
T., J. Cell ScL Suppl 75:97-104 (1994); Pawson T., Schlessinger J., Curr. Biol 3:434-442 
(1993); Mayer B.J., Baltimore D, Trends Cell Biol 5:8-13 (1993); Pawson T,, Nature 
373:573-580(1995)). 

10 The SH2 domain has a conserved 3D structure consisting of two alpha 

helices and six to seven beta-strands. The core of the domain is formed by a continuous 
beta-meander composed of two connected beta-sheets (Kuriyan J., Cowburn D,, Curr, 
Opin. Struct. Biol 3:828-837(1993)). The profile to detect SH2 domains is based on a 
structural alignment consisting of 8 gap-free blocks and 7 linker regions totaling 92 match 

15 positions. 

Src homology 3. SEQ ID NO:234, 1832, and 1835 represent polynucleotides 
encoding novel members of the family of Src homology 3 (SH3) proteins. The Src 
homology 3 (SH3) domain is a small protein domain of about 60 amino acid residues first 
identified as a conserved sequence in the non-catalytic part of several cytoplasmic protein 

20 tyrosine kinases (e.g., Src, Abl, Lck) (Mayer B.J. et al., Nature 352:272-275 (1988)). Since 
then, it has been found in a great variety of other intracellular or membrane-associated 
proteins (Musacchio A. et al., FEBS Lett. 307:55-61 (1992); Pawson T., Schlessinger J. ? 
Curr. Biol. 3:434-442 (1993); Mayer B.J., Baltimore D., Trends Cell Biol 5:8-13 (1993); 
Pawson T. ? Nature 575:573-580 (1995)). 

25 The SH3 domain has a characteristic fold which consists of five or six beta 

strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may 
contain short helices (Kuriyan J,, Cowburn D. 5 Curr. Opin. Struct. Biol 5:828-837 (1993)). 

The function of the SH3 domain may be to mediate assembly of specific 
protein complexes via binding to proline-rich peptides (Morton C.J., Campbell I.D., Curr. 

30 Biol 4:615-617(1994)). 

In general SH3 domains are found as single copies in a given protein, but 
there are a significant number of proteins with two SH3 domains and a few with 3 or 4 
copies. 

Fibronectin type III. SEQ ID NOs:746 and 1 192 represent polynucleotides 
35 encoding novel members of the family of fibronectin type III proteins. A number of 
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receptors for lymphokines, hematopoeitic growth factors and growth hormone-related 
molecules have been found to share a common binding domain. (Bazan J.F., Biochem. 
Biophys. Res. Commun. 764:788-795 (1989); Bazan J.F., Proc. Natl Acad. Sci. USA. 
57:6934-6938 (1990); Cosman D. et al., Trends Biochem. Set 75:265-270 (1990); d'Andrea 
5 A.D., Fasman G.D., Lodish H.F., Cell 55:1023-1024 (1989); d'Andrea A.D., Fasman G.D., 
Lodish H.F., Curr. Opin. Cell Biol 2:648-651 (1990)). 

The conserved region constitutes all or part of the extracellular ligand- 
binding region and is about 200 amino acid residues long. In the N-terminal of this domain 
there are two pairs of cysteines known, in the growth hormone receptor, to be involved in 
10 disulfide bonds. 

+ xxxxxxx + 

I C C C C Extracellular XXXXXXX Cytoplasmic | + 

- 1 - 1 1 - - 1 xxxxxxx + 

15 I I || Transmembrane 

+-+ +--+ 

Two patterns detect this family of receptors. The first one is derived from 
the first N-terminal disulfide loop, the second is a tryptophan-rich pattern located at the C- 
20 terminal extremity of the extracellular region. 

A consensus for this protein family is: C-[LVFYR]-x(7,8)-[STIVDN]-C-x- 
W (The two C's are linked by a disulfide bond], A second consensus for this protein family 
is: [STGL]-x-W-[SG]-x-W-S. 

LIM domain containing proteins. SEQ ID NOs:1269, 1309, 1360, and 1386 
25 represent polynucleotides encoding novel members of the family of LIM domain 
containing proteins. A number of proteins contain a conserved cysteine-rich domain of 
about 60 amino-acid residues. (Freyd G. et al., Nature 344:876-879 (1990); Baltz R. et al., 
Plant Cell 4:1465-1466 (1992); Sanchez-Garcia I, Rabbitts T.H., Trends Genet. 70:315- 
320(1994)). 

30 In the LIM domain, there are seven conserved cysteine residues and a 

histidine. The arrangement followed by these conserved residues is C-x(2)-C- x(16,23)-H- 
x(2)-[CH]-x(2)-C-x(2)-C-x(16,21)-C-x(2,3)-[CHD]. The LIM domain binds two zinc ions 
(Michelsen J.W. et al., Proa Natl Acad. Sci U.S.A. 90:4404-4408 (1993)). LIM does not 
bind DNA, rather it seems to act as interface for protein-protein interaction. The consensus 

35 for this protein family is: C-x(2)-C-x( 15,21 )-[FY WH]-H-x(2)-[CH].x(2)-C-x(2)-C-x(3)- 
[LIVMF]. The 5 C's and the H bind zinc. 
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C2 domain (protein kinase C like). SEQ ID NOs:1325 and 2282 represent 
polynucleotides encoding novel members of the family of C2 domain containing proteins. 
Some isozymes of protein kinase C (PKC) contain a domain, known as C2, of about 116 
amino-acid residues, which is located between the two copies of the CI domain (that bind 
5 phorbol esters and diacylglycerol) and the protein kinase catalytic domain. (Azzi A. et al., 
Eur. J. Biochem. 205:547-557 (1992); Stabel S., Semin. Cancer Biol. 5:277-284 (1994)). 

The C2 domain is involved in calcium-dependent phospholipid binding 
(Davletov B.A, Suedhof T.C., 1 Biol Chem. 265:26386-26390 (1993)). Since domains 
related to the C2 domain are also found in proteins that do not bind calcium, other putative 
10 functions for the C2 domain include binding to inositol- 1,3,5-tetraphosphate. (Fukuda M., 
et al., J. Biol Chem. 269:29206-29211 (1994).) 

The consensus pattern for the C2 domain is located in a conserved part of 
that domain, the connecting loop between beta strands 2 and 3. The profile for the C2 
domain covers the total domain. The consensus for this protein family is:: [ACG]-x(2)-L- 
1 5 x(2,3)-D-x(l ,2)-[NGSTLIF]-[GTMR]-x-[STAP]-D- [PA]-[FY] 

Serine proteases, trypsin family, active sites. SEQ ID NO: 1410 represents a 
polynucleotide encoding a novel member of the family of serine protease, trypsin proteins. 
The catalytic activity of the serine proteases from the trypsin family is provided by a charge 
relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which itself 
20 is hydrogen-bonded to a serine. The sequences in the vicinity of the active site serine and 
histidine residues are well conserved in this family of proteases (Brenner S., Nature 
354:528-530 (1988)). 

A consensus for this protein family is: [LIVM]-[ST]-A-[STAG]-H-C [H is 
the active site residue]. A second consensus for this protein family is: [DNSTAGC]- 
25 [GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]-[LIVMFYWH]-[LIVMFYSTANQH] 
[S is the active site residue]. 

RNA Recognition Motif Domain (RRM, RBD, or RNP). SEQ ID NOs: 
1464 and 1514 represent polynucleotides encoding novel members of the family of RNA 
recognition motif domain proteins (Bandziulis R.J. et al., Genes Dev. 3:431-437 (1989); 
30 Dreyfuss G. et al., Trends Biochem. Sci. 75:86-91 (1988)). 

Inside the putative RNA-binding domain there are two regions which are 
highly conserved. The first one is a hydrophobic segment of six residues (which is called 
the RNP-2 motif); the second one is an octapeptide motif (which is called RNP-1 or RNP- 
CS). The position of both motifs in the domain is shown in the following schematic 
35 representation: 
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xxxxxxx####xxxxxxxxxxxxxxxxxxxxxxxxxxxxx##W 
RNP-2 RNP-1 

5 As a consensus pattern for this type of domain the RNP-1 motif was used. 

The consensus for this protein family is: [RK]-G-{EDRKHPCG}-[AGSCI]-[FY]-[LIVA]- 
x-[FYLM] 

Phosphatidvlinositol-specific phospholipase C Y Domain. SEQ ID NO: 
1707 represents a polynucleotide encoding a novel member of the phosphatidylinositol- 

10 specific phospholipase C, Y domain family of proteins. Phosphatidylinositol-specific 
phospholipase C (EC3.1.4.1 1), a eukaryotic intracellular enzyme, plays an important role in 
signal transduction processes (Meldrum E. et al., Biochim. Biophys. Acta 7092:49-71 
(1991)). It catalyzes the hydrolysis of l-phosphatidyl-D-myo-inositol-3 5 4 ? 5- triphosphate 
into the second messenger molecules diacylglycerol and inositol- 1,4,5-triphosphate. This 

15 catalytic process is tightly regulated by reversible phosphorylation and binding of 
regulatory proteins (Rhee S.G., Choi K.D.. Adv. Second Messenger Phosphoprotein Res. 
25:35-61 (1992); Rhee S.G., Choi K.D., J. Biol. Chem. 267:12393-12396 (1992); Sternweis 
P.C., Smrcka A.V., Trends Biochem. Sci. 77:502-506 (1992)). 

All eukaryotic PI-PLCs contain two regions of homology, referred to as "X- 

20 box" and "Y-box". The order of these two regions is the same (NH2-X-Y-COOH), but the 
spacing is variable. In most isoforms, the distance between these two regions is only 50- 
1 00 residues but in the gamma isoforms one PH domain, two SH2 domains, and one SH3 
domain are inserted between the two PLC-specific domains. The two conserved regions 
have been shown to be important for the catalytic activity. At the C-terminal of the Y-box, 

25 there is a C2 domain possibly involved in Ca-dependent membrane attachment. 

Serine Carboxvpeptidases. SEQ ID NO: 1744 represents a polynucleotide 
encoding a novel member of the serine carboxypeptidases family of proteins. 
Carboxypeptidases may be either metallo carboxypeptidases or serine carboxypeptidases 
(EC 3.4.16.5 and EC 3.4.16.6). The catalytic activity of the serine carboxypeptidases, like 

30 that of the trypsin family serine proteases, is provided by a charge relay system involving 
an aspartic acid residue hydrogen-bonded to a histidine, which is itself hydrogen-bonded to 
a serine (Liao D.I., Remington S.J., 1 Biol Chem. 265:6528-6531 (1990)). 

The sequences surrounding the active site serine and histidine residues are 
highly conserved in all these serine carboxypeptidases. A consensus for this protein family 

35 is: [LIVM]-x-[GTA]-E-S-Y-[AG]-[GS] [S is the active site residue]. A second consensus 
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for this protein family is: [LIVF]-x(2)-[LIVSTA]-x-[IVPST]-x-[GSDNQL]-[SAGV]-[SG]- 
H-x- [IVAQ]-P-x(3)-[PSA] [H is the active site residue]. 

dsrm Double-Stranded RNA Binding Motif. SEQ ID NO: 181 8 represents a 
polynucleotide encoding a novel member of the dsrm double-stranded RNA binding motif 
5 proteins. In eukaryotic cells, a multitude of RNA-binding proteins play key roles in the 
posttranscriptional regulation of gene expression. Characterization of these proteins has led 
to the identification of several RNA-binding motifs. Several human and other vertebrate 
genetic disorders are caused by aberrant expression of RNA-binding proteins. (C G. Burd 
& G. Dreyfuss, Science 265: 615-621 (1994)). 

10 Proteins containing double stranded RNA binding motifs bind to specific 

RNA targets. Double stranded RNA binding motifs are exemplified by interferon-induced 
protein kinase in humans, which is part of the cellular response to dsRNA. 

SEQ ID NOs:2577, 3183 and 3195 encode members of the 4 trans- 
membrane integral membrane protein family. This family consists of type III proteins, 

15 which are integral membrane proteins that contain a N-terminal membrane-anchoring 
domain that is not cleaved during biosynthesis, and which functions as a translocation 
signal and a membrane anchor. The proteins also have three additional transmembrane 
regions. The consensus pattern is: G-x(3>[LIVMF]-x(2)-[GSA]-[LIVMF] (2)-G-C-x- 
[GA]-[STA]-x(20-[eG]-x(20-[CwN]-[LIVM](2). 

20 SEQ ID NO:2944 encodes a polypeptide having a calpain large subunit, 

domain III. Calpains are a family of intracellular proteases that play a variety of biological 
roles. Calpain 3, also known as p94, is predominantly expressed in skeletal muscle and 
plays a role in limb-girdle muscular dystrophy type 2A. (Sorimachi, H. et al., Biochem. J. 
328:721-732, 1997). 

25 SEQ ID NOs:1911 and 1980 encode polypeptides having a C3HC4 type 

zinc finger domain (RING finger), which is a cysteine-rich domain of 40 to 60 residues that 
binds two atoms of zinc, and is believed to be involved in mediating protein-protein 
interactions. Mammalian proteins of this family include V(D)J recombination activating 
protein, which activates the rearrangement of immunoglobulin and T-cell receptor genes; 

30 breast cancer type 1 susceptibility protein (BRCA1); bmi-1 proto-oncogene; cbl proto- 
oncogene; and mel-18 protein, which is expressed in a variety of tumor cells and is a 
transcriptional repressor that recognizes and binds a specific DNA sequence. The 
consensus pattern is: C-x-H-x-[LIVMFY]-C-x(2)-C-[LIVMYA]. 

SEQ ID NO:3274 encodes a eukaryotic transcription factor with a fork head 

35 domain, of about 100 amino acid residues. Proteins of this group are transcription factors, 
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including mammalian transcription factors HNF-3-alpha, -beta, and -gamma; interleukin- 
enhancer binding factor; and HTLF, which binds to a region of human T-cell leukemia 
virus long terminal repeat. The consensus pattern is [KR]-P-[PTQ]-[FYLVQH]-S- 
[FY]x(2)-[LIVM]-X(3,4)-[AC]-[LIM]. 
5 SEQ ID NO:3345 encodes a polypeptide having a PDZ domain. Several 

dozen signaling proteins belong to this group of proteins that have 80-100 residue repeats 
known as PDZ domains. Several of the proteins interact with the C-terminal tetrapeptide 
motifs X-Ser/Thr/X-Val-COO- of ion channels and/or receptors. (Ponting, C. P., Protein 
Sci, 6;464-468, 1997.) 

10 SEQ ID NO:3351 encodes a polypeptide in the family of phorbol 

esters/glycerol binding proteins. Phorbol esters (PE) are analogues of diacylglycerol 
(DAG) and potent tumor promoters. DAG activates a family of serine-threonine protein 
kinases, known as protein kinase C. The N-terminal region of protein kinase C binds PE 
and DAG, and contains one or two copies of a cysteine-rich domain of about 50 amino acid 

15 residues. Other proteins having this domain include diacylglycerol kinase; the vav 
oncogene; and N-chimaerin, a brain-specific protein. The DAG/PE binding domain binds 
two zinc ions through the six cysteines and two histidines that are conserved in the domain. 
The consensus pattern is: H-x-[LIVMFYW]-x(8, ll)-C-x(2)-C-x-(3)-[LIVMFC]-x(5 s 10)- 
C-x(2)-C-x(4)-[HD]-x(2)-C-x(5, 9)-C, 

20 SEQ ID N0.2216 encodes a polypeptide having a WW/rsp5/WWP domain. 

The protein is named for the presence of conserved aromatic positions, generally 
tryptophan, as well as a conserved proline. Proteins having the domain include dystrophin, 
vertebrate YAP protein, and IQGAP, a human GTPase activating protein which acts on ras. 
The consensus pattern is: W-x(9,ll)-[VFY]-[FYW]-x(6,7)-[GSTNE]-[GSTQCR]-[FYW]- 

25 x(2>P. 

SEQ ID NO: 2428 encodes a member of the dual specificity phosphatase 
family, having a catalytic domain, and SEQ IDS NOs:2281 and 2310 encode members of 
the protein tyrosine phosphatase family. These families are related and classified as 
tyrosine specific protein phosphatases. The enzymes catalyze the removal of a phosphate 
30 group from a tyrosine residue, and are important in the control of cell growth, proliferation, 
differentiation, and transformation. The consensus pattern is [LIVMF]-H-C-x(2)-G-x-(3)- 
[STC]-[STAGP]-x-[LIVMFY]. 
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Table 5 



SEQ ID 


Start 


Stop 


Score 


Direction 


Description 


29 


295 


421 


5872 


For 


mkk like kinases 


30 


31 


182 


3943 


For 


Basic region plus leucine zipper 
transcription factors 


31 


298 


397 


5625 


For 


mkk like kinases 


186 


175 


395 


7660 


For 


SH2 Domain 


187 


358 


432 


4320 


For 


Ank repeat 




37 


322 


6049 


For 


mkk like kinases 


234 


23 


121 


4607 


For 


SH3 Domain 


308 


110 


172 


4150 


For 


Zinc finger, C2H2 type 


410 


42 


191 


4036 


For 


Basic region plus leucine zipper 
transcription tactors 


431 


71 


428 


5538 


Rev 


ATPases Associated with Various 
Cellular Activities 


552 


1 16 


288 


3930 


Rev 


Basic region plus leucine zipper 
transcription factors 




1 57 


561 


5797 


For 


ATPases Associated with Various 
Cellular Activities 


746 


209 


427 


5379 


For 


Fibronectin type III domain 


768 


116 


288 


3930 


For 


Basic region plus leucine zipper 
transcription factors 


807 


339 


392 


3620 


For 


Zinc finger, C2H2 type 


820 


341 


406 


2930 


Rev 


EF-hand 


822 

UAL 


108 


262 


4179 


For 


Basic region plus leucine zipper 
transcription factors 


836 


158 


353 


4430 


For 


Basic region plus leucine zipper 
transcription factors 


1157 


41 


444 


5279 


Rev 


protein kinase 


1192 


186 


416 


5469 


For 


Fibronectin type III domain 


1268 


238 


315 


3540 


For 


Ank repeat 


1269 


79 


240 


11640 


For 


LIM domain containing proteins 
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OLoTl 




OwlC 


Dirprti'nn 


np^crintion 


lZoo 


I'X 
I J 






Fnr 


Rasie repion nlus leucine zinoer 
transcription factors 


1309 


248 


404 


8226 


for 


LIM domain containing proteins 


1324 


294 


356 


4690 


for 


Zinc finger, C2H2 type 


1325 


l 


234 


8981 


for 


C2 domain (prot. kinase C like) 


i J JO 




1 Or 


6190 


for 


WD domain G-beta reneats 


1 ^£0 
1 JOU 


111 


^77 
j / / 




for 


T TM domain containing nroteins 




O" 


9^7 


»/ZZ> 1 


for 


Rasic region nlus leucine zinner 
transcription factors 


1380 


42 


140 


7130 


for 


WD domain, G-beta repeats 


1386 


243 


398 


8736 


for 


LIM domain containing proteins 




zzz 




1 UJJJ 


for 

L\Jl 


TYvrwin 
1 1 y uoiii 


1417 


8 


354 


6073 


for 


Protein Tyrosine Phosphatase 


1454 


49 


209 


3996 


for 


Basic region plus leucine zipper 

tranQrrirvfinn fartnrs 


1404 


A 


1 OU 


4Q78 

4/ / O 


fnr 


RNA rerocrnition motif faka RRM 

RBD, or RNP domain) 


1478 


54 


437 


5176 


for 


protein kinase 


1496 


241 


520 


3929 


for 


Helicases conserved C-terminal domain 


1 /IDA 
14V0 


AO 




S1 87 


for 


nrntf*in kinase 




1 34 




4870 


for 


7inc finder C2H2 tvne 


1514 


2 


252 


4662 


for 


RNA recognition motif, (aka RRM, 
RRD or RNP domain"! 


1527 


156 


212 


3520 


for 


Zinc finger, C2H2 type 


1538 


9 


635 


11087 


for 


wnt family of developmental signaling 
proteins 


1540 


289 


471 


4107 


for 


Basic region plus leucine zipper 
transcription factors 


1549 


200 


391 


4118 


for 


Basic region plus leucine zipper 
transcription factors 


1 r c f 

1556 


163 


1 C A 

354 


395o 


lor 


Basic region plus leucine zipper 
transcription factors 


1557 


207 


398 


4038 


for 


Basic region plus leucine zipper 
transcription factors 
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SEQ ID 


Start 


Stop 


Score 


Direction 


Description 


1563 


107 


298 


3978 


for 


Basic region plus leucine zipper 
transcription factors 


1622 


180 


365 


4022 


for 


Basic region plus leucine zipper 

tmnQPrintinn fflptnr^ 

llCUloVl lUUVJli ItlvLvlO 


1 

LOjv 


1 fin 




jyyo 


fnr 

IK/I 


transcription factors 


1674 


196 


258 


4880 


for 


Zinc finger, C2H2 type 


1676 


9 


86 


6610 


for 


Homeobox Domain 


lo/ / 


j 10 


3 AO 


J 1 6U 


rev 


TVi i n r*=»rl n y i n c 
1 IllUICUUAillo 


1688 


109 


410 


17414 


for 


Ras family 


1704 


184 


372 


3977 


for 


Basic region plus leucine zipper 

tt*anc***riTVt"ir\?i Tar*triT*c 
iralloWripHUil laCLUIo 


1 ^m 






ZH- 1 V\J 


rev 


I 11U jUHcHIU-Y IHlVJollUl oUCL'lllC 

phospholipase C, Y domain 


1711 


263 


361 


6400 


for 


WD domain, G-beta repeats 


1744 


238 


433 


10572 


rev 


Serine carboxypeptidases 


1755 


281 


367 


2580 


for 


EF-hand 


1762 


236 


334 


5880 


for 


WD domain, G-beta repeats 


1779 


64 


126 


4790 


for 


Zinc finger, C2H2 type 


1 5U1 






Afi^fi 
HUjU 




Z^lIlC lillgCl, v^z-iiz. ijpe 


1 OA/1 
lOU't 




j to 




fnr 

iur 


/AIIK repeat 


1808 


36 


161 


4170 


for 


Basic region plus leucine zipper 
transcription factors 


1811 


184 


315 


8390 


for 


N-terminal homology in Ets domain 


1814 


127 


294 


10770 


for 


Bromodomain (conserved sequence 

frmnH in human T^rrwnnhilfi ann v^a^t 
1UUHU. ill liuiiidii, J-/nJoupniia. cum y\*cLJti 

proteins.) 


1818 


9 


146 


4741 


for 


Double-stranded RNA binding motif 


1819 


278 


355 


3460 


for 


Ank repeat 


1820 


123 


299 


12150 


for 


Homeobox Domain 


1821 


127 


303 


12180 


for 


Homeobox Domain 


1830 


184 


267 


4270 


for 


Ank repeat 


1832 


18 


173 


8987 


for 


SH3 Domain 


1835 


51 


206 


8987 


for 


SH3 Domain 
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SEQ ID 


start 


btop 


Score 


Direction 


uescnpuon 


1839 


224 


307 


4270 


for 


Ank repeat 


1846 


12 


398 


36700 


for 


G-protein alpha subunit 


1909 


160 


258 


6370 


for 


WD domain, G-beta repeats 


1911 


35 


151 


9335 


for 


Zinc finger, C3HC4 type (RING finger) 


1980 


60 


197 


7917 


for 


Zinc finger, C3HC4 type (RING finger) 


2065 


253 


306 


5410 


for 


Zinc finger, CCHC class 


2135 


2 


401 


10596 


for 


ATPases Associated with Various 
Cellular Activities 


2216 


90 


179 


5380 


for 


WW/rsp5/WWP domain containing 
proteins 


2218 


127 


225 


5500 


for 


WD domain, G-beta repeats 


2281 


20 


387 


6044 


for 


Protein Tyrosine Phosphatase 


2282 


183 


353 


5136 


for 


C2 domain (prot. kinase C like) 


2286 


12 


382 


5228 


for 


protein kinase 


2310 


20 


371 


5962 


for 


Protein Tyrosine Phosphatase 


2363 


48 


211 


4132 


for 


Basic region plus leucine zipper 
transcription factors 


2424 


43 


194 


3996 


for 


Basic region plus leucine zipper 
transcription factors 


9428 


25 


350 


4675 


for 


Dual specificity phosphatase, catalytic 
domain 


2562 


18 


101 


4560 


for 


Ank repeat 


2577 


0 


311 


10295 


for 


4 transmembrane segments integral 
membrane proteins 


2591 


60" 


165 


4560 


for 


SH2 Domain 


2684 


9 


461 


5759 


for 


ATPases Associated with Various 
Cellular Activities 


2826 


116 


400 


16107 


for 


DEAD and DEAH box helicases 


2859 


100 


320 


5550 


rev 


ATPases Associated with Various 
Cellular Activities 


2871 


198 


392 


9384 


for 


DEAD and DEAH box helicases 


2944 


18 


281 


10480 


for 


Calpain large subunit, domain III 


2969 


5 


387 


5976 


rev 


protein kinase 


3015 


131 


214 


3600 


for 


Ank repeat 


3047 


191 


292 


5295 


for 


WD domain, G-beta repeats 


3081 


190 


252 


4360 


for 


Zinc finger, C2H2 type 


3108 


275 


367 


5791 


for 


WD domain, G-beta repeats 


3147 


190 


369 


4022 


for 


Basic region plus leucine zipper 
transcription factors 
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SEQ ID 


Start 


Stop 


Score 


Direction 


Description 


3152 


129 


320 


3947 


for 


Basic region plus leucine zipper 
transcription factors 


3158 


167 


334 


4180 


for 


Basic region plus leucine zipper 
transcription factors 


3175 


14 


164 


5951 


for 


mkk like kinases 


3175 


8 


112 


5968 


for 


protein kinase 


3178 


45 


386 


19398 


for 


ATPases Associated with Various 
Cellular Activities 


3183 


14 


215 


9133 


for 


4 transmembrane segments integral 
membrane proteins 


3190 


229 


390 


6089 


for 


mkk like kinases 


3190 


118 


390 


8063 


for 


protein kinase 


3193 


293 


355 


3570 


for 


Zinc finger, C2H2 type 


3195 


0 


215 


10146 


for 


4 transmembrane segments integral 
membrane proteins 


3197 


281 


343 


4490 


for 


Zinc finger, C2H2 type 


3208 


34 


256 


4190 


for 


Basic region plus leucine zipper 
transcription factors 


3258 


138 


394 


9877 


for 


Ras family 


3266 


8 


139 


9328 


for 


ATPases Associated with Various 
Cellular Activities 


3267 


97 


180 


3820 


for 


Ank repeat 


3274 


11 


187 


15442 


for 


Fork head domain, eukaryotic 
transcription factors 


3281 


15 


182 


9681 


for 


mkk like kinases 


3285 


16 


102 


4680 


for 


EF-hand 


3292 


208 


300 


5585 


for 


WD domain, G-beta repeats 


3297 


7 


153 


6100 


for 


Helicases conserved C-terminal domain 


3306 


161 


223 


4900 


for 


Zinc finger, C2H2 type 


3307 


43 


321 


8740 


for 


SH2 Domain 


3339 


94 


342 


14970 


for 


SH2 Domain 


3345 


65 


271 


12512 


for 


PDZ domain 


3351 


124 


270 


6068 


for 


Phorbol esters/diacylglycerol binding 
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Example 4 

Differential Expression of Polynucleotides of the Invention: 
Description of Libraries and Detection of Differential Expression 

5 The relative expression levels of the polynucleotides of the invention was 

assessed in several libraries prepared from various sources, including cell lines and patient 
tissue samples. Table 6 provides a summary of these libraries, including the shortened 
library name (used hereafter), the mRNA source used to prepare the cDNA library, the 
abbreviated name of the library that is used in the tables below (in quotes), and the 
10 approximate number of clones in the library. 



Table 6 

Description of cDNA Libraries 



Library 
(lib #) 


Description 


Number of 
Clones in 

this 
Clustering 


1 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential 
(derived from Kml2C) 
"High Colon" 


307133 




Kml2C 

Human Colon Cell Line, Low Metastatic Potential 
"Low Colon" 


284755 


3 


MDA-MB-231 

Human Breast Cancer Cell Line, High Metastatic Potential; 
micro-metastases in lung 
"High Breast" 


326937 


4 


MCF7 

Human Breast Cancer Cell, Non Metastatic 
"Low Breast" 


318979 


8 


MV-522 

Human Lung Cancer Cell Line, High Metastatic Potential 
"High Lung" 


223620 


9 


UCP-3 

Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 
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Library 
(lib #) 


Description 


Number of 
Clones in 

this 
Clustering 


12 


Human microvascular endothelial cells (HMEC) - Untreated 
PCR (OligodT) cDNA library 


41938 


13 


Human microvascular endothelial cells (HMEC) - 
Basic fibroblast growth factor (bFGF) treated 
PCR (OligodT) cDNA library 


42100 


14 


Human microvascular endothelial cells (HMEC) - 
Vascular endothelial growth factor (VEGF) treated 
PCR (OligodT) cDNA library 


42825 


15 


Normal Colon - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


34285 


16 


Colon Tumor - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


35625 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


36984 


18 


Normal Colon - UC#3 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


36216 


19 


Colon Tumor - UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Tumor Tissue" 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


30956 


21 


GRRpz 

Human Prostate Cell Line 


164801 


22 


WOca 

Human Prostate Cancer Cell Line 


162088 



The KM12L4 and KM12C cell lines are described in Example 1 above. The 
MDA-MB-23 1 cell line was originally isolated from pleural effusions (Cailleau, J. Natl 
Cancer. Inst. (1974) 53:661), is of high metastatic potential, and forms poorly 
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differentiated adenocarcinoma grade II in nude mice consistent with breast carcinoma. The 
MCF7 cell line was derived from a pleural effusion of a breast adenocarcinoma and is non- 
metastatic. The MV-522 cell line is derived from a human lung carcinoma and is of high 
metastatic potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell 
5 line; the MV-522 is a high metastatic variant of UCP-3. These cell lines are well- 
recognized in the art as models for the study of human breast and lung cancer (see, e.g., 
Chandrasekaran et al., Cancer Res. (1979) J9:870 (MDA-MB-231 and MCF-7); Gastpar et 
aUJMedChem (1998) 47:4965 (MDA-MB-231 and MCF-7); Ranson et al, Br J Cancer 
(1998) 77:1586 (MDA-MB-231 and MCF-7); Kuang et al. Nucleic Acids Res (1998) 

10 26:1 1 16 (MDA-MB-23 1 and MCF-7); Varki et al, Int J Cancer (1987) 40:46 (UCP-3); 
Varki et al. Tumour Biol (1990) 77:327; (MV-522 and UCP-3); Varki et al. Anticancer 
Res. (1990) 70:637; (MV-522); Kelner et al. Anticancer Res (1995) 75:867 (MV-522); and 
Zhang et al. Anticancer Drugs (1997) 5:696 (MV522)). The samples of libraries 15-20 are 
derived from two different patients (UC#2, and UC#3). The bFGF-treated HMEC were 

15 prepared by incubation with bFGF at 10ng/ml for 2 hrs; the VEGF -treated HMEC were 
prepared by incubation with 20ng/ml VEGF for 2 hrs. Following incubation with the 
respective growth factor, the cells were washed and lysis buffer added for RNA 
preparation. The GRRpz cell line refers to low passage (3 passages or fewer) human 
prostate cells, and the WOca cell line refers to low passage (3 passages or fewer) human 

20 prostate cancer cells. 

Each of the libraries is composed of a collection of cDNA clones that in turn 
are representative of the mRNAs expressed in the indicated mRNA source. In order to 
facilitate the analysis of the millions of sequences in each library, the sequences were 
assigned to clusters. The concept of "cluster of clones" is derived from a sorting/grouping 

25 of cDNA clones based on their hybridization pattern to a panel of roughly 300 7bp 
oligonucleotide probes (see Drmanac et al. Genomics (1996) 57(1):29). Random cDNA 
clones from a tissue library are hybridized at moderate stringency to 300 7bp 
oligonucleotides. Each oligonucleotide has some measure of specific hybridization to that 
specific clone. The combination of 300 of these measures of hybridization for 300 probes 

30 equals the "hybridization signature" for a specific clone. Clones with similar sequence will 
have similar hybridization signatures. By developing a sorting/grouping algorithm to 
analyze these signatures, groups of clones in a library can be identified and brought 
together computationally. These groups of clones are termed "clusters". Depending on the 
stringency of the selection in the algorithm (similar to the stringency of hybridization in a 

35 classic library cDNA screening protocol), the "purity" of each cluster can be controlled. 
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For example, artifacts of clustering may occur in computational clustering just as artifacts 
can occur in "wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at even 
the highest stringency. The stringency used in the implementation of cluster herein 
provides groups of clones that are in general from the same cDNA or closely related 
5 cDNAs. Closely related clones can be a result of different length clones of the same 
cDNA, closely related clones from highly related gene families, or splice variants of the 
same cDNA. 

Differential expression for a selected cluster was assessed by first 
determining the number of cDNA clones corresponding to the selected cluster in the first 

10 library (Clones in 1 st ), and the determining the number of cDNA clones corresponding to 
the selected cluster in the second library (Clones in 2 nd ). Differential expression of the 
selected cluster in the first library relative to the second library is expressed as a "ratio" of 
percent expression between the two libraries. In general, the "ratio" is calculated by: 1) 
calculating the percent expression of the selected cluster in the first library by dividing the 

15 number of clones corresponding to a selected cluster in the first library by the total number 
of clones analyzed from the first library; 2) calculating the percent expression of the 
selected cluster in the second library by dividing the number of clones corresponding to a 
selected cluster in a second library by the total number of clones analyzed from the second 
library; 3) dividing the calculated percent expression from the first library by the calculated 

20 percent expression from the second library. If the "number of clones" corresponding to a 
selected cluster in a library is zero, the value is set at 1 to aid in calculation. The formula 
used in calculating the ratio takes into account the "depth" of each of the libraries being 
compared, i.e., the total number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially 

25 expressed between two samples when the ratio value is greater than at least about 2, 
preferably greater than at least about 3, more preferably greater than at least about 5 , where 
the ratio value is calculated using the method described above. The significance of 
differential expression is determined using a z score test (Zar, Biostatistical Analysis, 
Prentice Hall, Inc., USA, "Differences between Proportions," pp 296-298 (1974)). 
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EXAMPLE 5 

Polynucleotides Differentially Expressed in High Metastatic Potential 
Breast Cancer Cells Versus Low Metastatic Breast Cancer Cells 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high metastatic potential breast cancer 
tissue and low metastatic breast cancer cells. Expression of these sequences in breast 
cancer can be valuable in determining diagnostic, prognostic and/or treatment information. 
For example, sequences that are highly expressed in the high metastatic potential cells can 

10 be indicative of increased expression of genes or regulatory sequences involved in the 
metastatic process. A patient sample displaying an increased level of one or more of these 
polynucleotides may thus warrant more aggressive treatment. In another example, 
sequences that display higher expression in the low metastatic potential cells can be 
associated with genes or regulatory sequences that inhibit metastasis, and thus the 

15 expression of these polynucleotides in a sample may warrant a more positive prognosis 
than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the like. 
These polynucleotide sequences can also be used in combination with other known 

20 molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 
expressed between high metastatic potential breast cancer cells and low metastatic potential 
breast cancer cells. 

Table 7 

25 Differentially expressed polynucleotides: Higher expression in 

high metastatic potential breast cancer (Hb3) relative to low metastatic 
breast cancer cells (lib4) 



SEQ IDNOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


472 


64 


0 


62 


1851 


6 


0 


6 


1856 


8 


0 


8 


1867 


6 


0 


6 


1872 


6 


0 


6 


1875 


12 


3 


4 
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SEQ ID NOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


1923 


89 


22 


4 


2118 


7 


0 


7 


2119 


7 


0 


7 


2135 


37 


13 


3 


2190 


19 


0 


19 


2193 


16 


5 


3 


2232 


12 


2 


6 


2239 


6 


0 


6 


2338 


21 


2 


10 


2378 


16 


4 


4 


2394 


6 


0 


6 


2395 


6 


0 


6 


2490 


13 


3 


4 


2505 


16 


2 


8 


2540 


8 


1 


8 


2542 


11 


1 


11 


2607 


11 


2 


5 


2640 


22 


5 


4 


2674 


8 


0 


8 


2679 


19 


0 


19 


2684 


14 


4 


3 


2707 


8 


0 


8 


2724 


9 


0 


9 


2757 


6 


0 


6 


2776 


10 


0 


10 


2804 


13 


2 


6 


2818 


6 


0 


6 


2906 


14 


0 


14 


2959 


26 


8 


3 


2964 


17 


4 


4 


2968 


6 


0 


6 


2977 


22 


3 


7 


2980 


13 


1 


13 


3010 


6 


0 


6 


3043 


10 


1 


10 


3071 


33 


12 


3 


3072 


9 


1 


9 


3095 


19 


3 


6 


3097 


11 


2 


5 


3173 


12 


2 


6 
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SEQ ID NOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


3203 


8 


1 


8 


3210 


27 


8 


3 


3212 


13 


1 


13 


3284 


8 


0 


8 


3288 


6 


0 


6 


3331 


14 


3 


5 


3335 


13 


1 


13 



Table 8 

Differentially expressed polynucleotides: Higher expression in 
low metastatic breast cancer cells (lib4) relative to high metastatic 
5 potential breast cancer (lib3) 



SEQ ID NOs: 


Lib 3 Clones 


Lib 4 Clones 


Iib4/lib3 


402 


0 


6 


6 


614 


3 


21 


7 


624 


0 


6 


6 


626 


0 


8 


8 


712 


0 


9 


9 


744 


0 


7 


7 


1325 


2 


29 


15 


1452 


2 


13 


7 


1880 


0 


9 


9 


1915 


0 


7 


7 


1951 


0 


6 


6 


1955 


8 


32 


4 


2015 


0 


7 


7 


2046 


0 


7 


7 


2076 


1 


22 


23 


2087 


0 


6 


6 


2124 


0 


9 


9 


2145 


0 


8 


8 


2162 


0 


6 


6 


2163 


0 


12 


12 


2164 


5 


19 


4 


2172 


2 


15 


8 


2192 


5 


16 


3 


2244 


20 


43 


2 
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SEQ ID NOs: 


Lib 3 Clones 


Lib 4 Clones 


Iib4/lib3 


2266 


3 


18 


6 


2313 


24 


56 


2 


2346 


1 


13 


13 


2355 


0 


10 


10 


2371 


0 


6 


6 


2393 


1 


17 


17 


2404 


1 


21 


22 


2443 


0 


6 


6 


2460 


0 


11 


11 


2523 


0 


6 


6 


2575 


1 


10 


10 


2578 


0 


6 


6 


2584 


1 


17 


17 


2590 


0 


6 


6 


2609 


1 


9 


9 


2632 


5 


24 


5 


2714 


5 


24 


5 


2728 


0 


6 


6 


2752 


1 


14 


14 


2794 


4 


15 


4 


2826 


0 


7 


7 


2987 


5 


15 


3 


3005 


1 


14 


14 


3009 


20 


58 


3 


3047 


4 


17 


4 


3057 


2 


17 


9 


3075 


2 


11 


6 


3076 


0 


6 


6 


3102 


0 


6 


6 


3128 


15 


52 


4 


3132 


15 


52 


4 


3142 


0 


6 


6 


3187 


22 


49 


2 


3253 


23 


96 


4 


3282 


19 


46 


2 


3285 


20 


40 


2 


3346 


0 


9 


9 
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EXAMPLE 6 

Polynucleotides Differentially Expressed in High Metastatic Potential Lung 
Cancer Cells Versus Low Metastatic Lung Cancer Cells 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high metastatic potential lung cancer 
cells and low metastatic lung cancer cells. Expression of these sequences in lung cancer 
tissue can be valuable in determining diagnostic, prognostic and/or treatment information. 
For example, sequences that are highly expressed in the high metastatic potential cells can 

10 be indicative of increased expression of genes or regulatory sequences involved in the 
metastatic process. A patient sample displaying an increased level of one or more of these 
polynucleotides may thus warrant more aggressive treatment. In another example, 
sequences that display higher expression in the low metastatic potential cells can be 
associated with genes or regulatory sequences that inhibit metastasis, and thus the 

15 expression of these polynucleotides in a sample may warrant a more positive prognosis 
than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker a prognostic marker, for risk assessment, patient treatment and the like. 
These polynucleotide sequences can also be used in combination with other known 

20 molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 
expressed between high metastatic potential lung cancer cells and low metastatic potential 
lung cancer cells: 
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Table 9 

Differentially expressed polynucleotides: Higher expression in high 
metastatic potential lung cancer cells (lib8) relative to low 
metastatic lung cancer cells (lib9) 





LjLUQ CiUHCo 




Iib8/lib9 




10 
1 u 


o 


10 


1 ^7 


J 


o 


5 


1 ^1 
1 J 1 


J 


n 


7 
/ 


1 O 

1 JZ 




n 

u 


n 


1 71 
I / 1 


o 


0 


o 


zuu 


1 0 


0 


14 


zj4 


J 


fi 


7 


7£7 
ZOZ 




0 
\J 


7 


771 
Z / 1 


D 




7 


T/1 O 

34o 


O 


1 


0 


4iz 




n 

u 


7 






n 


7 


!>zU 


0 


n 

\J 


c 

o 


j3U 


c 
J 


fi 


7 
/ 


JOO 




n 


7 


£71 

ozJ 


7 


fi 


10 


oi / 


7 


fi 


10 


££fi 


J 


fi 


7 


0 /o 


0 


0 


1 1 
1 1 


£ftfi 
05U 


J 


o 


7 

/ 


700 


Q 


7 
z 


A 


714 
/ 14 


Zo 


1 1 




774 


1 1 
1 1 


fi 


15 


817 
o 1Z 


J 


fi 


7 




o 


7 


6 


901 


11 


2 


8 


1168 


5 


0 


7 


1333 


6 


0 


8 


1352 


5 


0 


7 


1524 


11 


1 


15 


1706 


5 


0 


7 


1752 


17 


9 


3 


1768 


20 


4 


7 


1769 


5 


0 


7 


1780 


6 


0 


8 
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SEQ ID NO: 


Lib8 clones 


Lib9 clones 


Iib8/lib9 


1781 


40 


3 


19 


1799 


6 


1 


8 


1803 


6 


1 


8 


1811 


16 


9 


2 


1884 


6 


0 


8 


1919 


8 


1 


11 


1939 


6 


0 


8 


1975 


43 


9 


7 


2024 


12 


1 


17 


2045 


8 


1 


11 


2060 


20 


13 


2 


2071 


16 


4 


6 


2128 


5 


0 


7 


2177 


10 


2 


7 


2181 


44 


13 


5 


2184 


11 


1 


15 


2185 


10 


4 


3 


2283 


7 


0 


10 


2311 


10 


4 


3 


2314 


10 


0 


14 


2393 


14 


6 


3 


2398 


6 


1 


8 


2460 


10 


4 


3 


2514 


6 


0 


8 


2597 


5 


0 


7 


2657 


8 


2 


6 


2669 


6 


1 


8 


2670 


6 


1 


8 


3047 


21 


3 


10 


3050 


16 


5 


4 


3092 


7 


1 


10 


3140 


181 


119 


2 


3157 


5 


0 


7 


3187 


16 


5 


4 


3210 


5 


0 


7 


3220 


28 


4 


10 


3236 


7 


1 


10 


3249 


16 


0 


22 


3264 


8 


2 


6 


3305 


7 


0 


10 
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SEQ ID NO: 


Lib8 clones 


Lib9 clones 


Iib8/lib9 


3309 


20 


0 


28 


3318 


24 


4 


8 


3330 


5 


0 


7 


3331 


5 


0 


7 



Table 10 

Differentially expressed polynucleotides: Higher expression in low metastatic lung cancer 
cells (lib 9) relative to high metastatic potential lung cancer cells (lib 8) 



SEQ ID NO: 


Lib 8 clones 


Lib 9 clones 


lib 9/lib 8 


24 


3 


20 


5 


53 


0 


18 


13 


64 


0 


8 


6 


70 


0 


11 


8 


105 


10 


66 


5 


129 


0 


16 


11 


214 


1 


14 


10 


233 


4 


35 


6 


237 


0 


13 


9 


264 


0 


29 


21 


329 


2 


17 


6 


368 


1 


37 


26 


370 


0 


1 1 


o 
o 


418 


0 


8 


6 


450 


0 


9 


6 


461 


0 


9 


6 


484 


0 


26 


19 


494 


0 


41 


29 


517 


1 


12 


9 


522 


1 


11 


8 


581 


1 


17 


12 


614 


3 


23 


5 


706 


0 


11 


8 


726 


5 


23 


3 


806 


0 


14 


10 


824 


0 


9 


6 


836 


1 


14 


10 


874 


0 


12 


9 
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SEQ ID NO: 


Lib 8 clones 


Lib 9 clones 


lib 9/lib 8 


900 


5 


21 


3 


1017 


2 


14 


5 


1144 


0 


8 


6 


1154 


0 


12 


9 


1166 


2 


45 


16 


1170 


1 


13 


9 


1302 


2 


13 


5 


1326 


1 


13 


9 


1327 


1 


13 


9 


1367 


0 


12 


9 


1377 


0 


12 


9 


1437 


2 


18 


6 


1442 


1 


14 


10 


1466 


0 


13 


9 


1476 


0 


13 


9 


1495 


0 


8 


6 


1496 


1 


13 


9 


1664 


38 


253 


5 


1682 


1 


17 


12 


1687 


0 


9 


6 


1758 


0 


8 


6 


1817 


4 


18 


3 


1837 


3 


16 


4 


1845 


3 


23 


5 


1856 


2 


17 


6 


1910 


1 


18 


13 


2146 


2 


16 


9 


2156 


0 


9 


6 


2463 


0 


12 


9 . 


2724 


10 


38 


3 


2749 


403 


2000 


4 


2801 


6 


25 


3 


2993 


3 


18 


4 


3080 


0 


10 


7 


3107 


3 


23 


5 


3292 


0 


20 


14 


3324 


110 


548 


4 
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EXAMPLE 7 

Polynucleotides Differentially Expressed in High Metastatic Potential Colon 
Cancer Cells Versus Low Metastatic Colon Cancer Cells 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high metastatic potential colon cancer 
cells and low metastatic colon cancer cells. Expression of these sequences in colon cancer 
tissue can provide diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells can be indicative 

10 of increased expression of genes or regulatory sequences involved in the metastatic process. 
A patient sample displaying an increased level of one or more of these polynucleotides may 
thus warrant more aggressive treatment. In another example, sequences that display higher 
expression in the low metastatic potential cells can be associated with genes or regulatory 
sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

15 sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the like. 
These polynucleotide sequences can also be used in combination with other known 
molecular and/or biochemical markers. 

20 The following table summarizes identified polynucleotides with differential 

expression between high metastatic potential colon cancer cells and low metastatic 
potential colon cancer cells: 

Table 11 

Differentially expressed polynucleotides: Higher expression in low metastatic colon cancer 
25 cells (lib 2) relative to high metastatic potential colon cancer cells (lib 1) 



SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


429 


0 


9 


10 


1494 


0 


8 


9 


1923 


34 


114 


4 


1986 


3 


12 


4 


2018 


0 


9 


10 


2036 


2 


10 


5 


2049 


8 


25 


3 


2135 


24 


87 


4 
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SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


2146 


2 


16 


9 


2208 


6 


27 


5 


2215 


2 


11 


6 


2239 


1 


10 


11 


2307 


2 


12 


6 


2313 


28 


62 


2 


2357 


5 


14 


3 


2360 


3 


21 


8 


2362 


0 


6 


6 


2378 


3 


12 


4 


2569 


3 


20 


7 


2571 


0 


6 


6 


2588 


54 


172 


3 


2592 


15 


41 


3 


2611 


0 


6 


6 


2636 


0 


9 


10 


2641 


7 


20 


3 


2650 


0 


9 


10 


2662 


0 


9 


10 


2674 


4 


13 


4 


2682 


0 


6 


6 


2702 


9 


25 


3 


2704 


8 


23 


3 


2715 


2 


12 


6 


2804 


9 


22 


3 


2821 


13 


29 


2 


2840 


1 


8 


9 


2846 


2 


15 


8 


2866 


0 


6 


6 


2906 


0 


6 


6 


2915 


44 


109 


3 


2933 


0 


6 


6 


2935 


5 


16 


3 


2957 


1 


11 


12 


2959 


3 


27 


10 


2977 


16 


30 


2 


2980 


12 


27 


2 


3000 


2 


13 


7 


3009 


12 


29 


3 


3115 


0 


7 


8 
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SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


3156 


502 


2170 


5 


3210 


2 


21 


11 


3211 


0 


9 


10 


3213 


0 


7 


8 


3235 


2 


12 


6 


3251 


2 


12 


6 


3296 


3 


12 


4 


3335 


1 


8 


9 



EXAMPLE 8 

Polynucleotides Differentially Expressed in High Metastatic Potential 
Colon Cancer Patient Tissue Versus Normal Patient Tissue 

5 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from high metastatic potential colon cancer 
tissue and normal tissue. Expression of these sequences in colon cancer tissue can provide 
diagnostic, prognostic and/or treatment information. For example, sequences that are 

10 highly expressed in the high metastatic potential cells can be indicative of increased 
expression of genes or regulatory sequences involved in the advanced disease state which 
involves processes such as angiogenesis, dedifferentiation, cell replication, and metastasis. 
A patient sample displaying an increased level of one or more of these polynucleotides may 
thus warrant more aggressive treatment. 

15 The differential expression of these polynucleotides can be used as a 

diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the like. 
These polynucleotide sequences can also be used in combination with other known 
molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 

20 expressed between high metastatic potential colon cancer tissue and normal colon tissue: 
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Table 12 

Differentially expressed polynucleotides isolated from samples from two patients (patient 2 
and patient 3 and) : Lower expression in high metastatic potential colon tissue (patient 2:lib 
17; patient 3:lib 20) vs. normal colon tissue (patient 2:lib 15; patient 3:lib 18) 



SEQ ID NO: 


lib 1 5 clones 


lib 17 clones 


lib 15/llD 1 / 


69 


19 


1 


3 


123 


6 


0 


6 


140 


24 


8 


3 


197 


6 


0 


6 


198 


113 


0 


I2l 


254 


28 


9 


3 


412 


28 


9 


3 


512 


11 


1 


12 


641 


17 


7 


3 


642 


7 


0 


8 


954 


12 


3 


4 


1011 


209 


16 


1 A 

14 


1024 


8 


0 


9 


1040 


12 


3 


A 

4 


1055 


26 


7 


4 


1106 


31 


15 


2 


1125 


17 


0 


18 


1129 


17 


0 


18 


1138 


109 


0 


11—7 

H7 


1244 


14 


1 


15 


1253 


73 


0 


78 


1283 


34 


7 


5 


1285 


34 


7 


5 


1339 


13 


4 


3 


1474 


73 


0 


78 


1505 


16 


■J 


6 


1553 


68 


6 


12 


1554 


2542 


14 


195 


1605 


2542 


14 


195 


1628 


6 


0 


6 


1643 


142 


4 


38 


1753 


12 


0 


10 


1764 


13 


0 


14 


SEQ ID NO: 


Lib 18 Clones 


Lib20 Clones 


Iibl8/lib20 


105 


28 


11 


2 
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SEQ ID NO: 


lib 1 5 clones 


lib 17 clones 


lib 15/lib 17 


198 


21 


0 


18 


254 


9 


0 


8 


412 


9 


0 


8 


1011 


11 


1 


9 


1138 


14 


0 


12 


1253 


23 


0 


20 


1643 


18 


0 


15 


1764 


12 


0 


10 


3156 


140 


43 


3 



Table 13 

Differentially expressed polynucleotides isolated from samples from two patients (patient 2 
and patient 3): Lower expression in normal colon tissue (patient 2:lib 15; patient 3:lib 
18)vs. high metastatic potential colon tissue (patient 2;lib 17; patient 3:lib 20). 



SEQ ID NO: 


Lib 15 Clones 


Lib 1 7 Clones 


lib 17/lib 15 


321 


3 


23 


7 


363 


1 


9 


8 


836 


21 


99 


4 


859 


6 


20 


3 


885 


13 


28 


2 


916 


13 


28 


2 


981 


2 


11 


5 


1226 


8 


70 


8 


1308 


0 


8 


7 


1317 


29 


84 


3 


1429 


27 


127 


4 


1442 


0 


9 


8 


1534 


1 


12 


11 


1540 


12 


43 


3 


1552 


0 


7 


7 


1556 


1 


9 


8 


1557 


1 


9 


8 


1569 


2189 


5122 


2 


1571 


6 


18 


3 


1576 


3 


25 


8 


1581 


4 


22 


5 


1601 


25 


157 


6 
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SEQ ID NO: 


Lib 15 Clones 


Lib 17 Clones 


lib 17/lib 15 


1613 


9 


48 


5 


1616 


15 


61 


4 


1620 


2 


17 


8 


1622 


4 


99 


23 


1626 


6 


35 


5 


1647 


4 


22 


5 


1664 


4 


28 


7 


1683 


2 


18 


8 


1704 


3 


15 


5 


1800 


0 


7 


7 


2749 


23 


60 


2 


2784 


4 


14 


3 


2805 


1 


9 


8 


2976 


3 


14 


4 


3128 


18 


57 


3 


3129 


26 


124 


4 


3146 


64 


210 


3 


3150 


940 


2267 


2 


3151 


2 


15 


7 










SEQ ID NO: 


lib 1 8 clones 


lib 20 clones 


lib 20/lib 18 


865 


0 


5 


6 


1569 


1 


7 


8 


1580 


1 


7 


8 


1590 


1 


7 


8 


2790 


0 


5 


6 



EXAMPLE 9 

Polynucleotides Differentially Expressed in High Colon Tumor Potential 
Patient Tissue Versus Metastasized Colon Cancer Patient Tissue 
5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from colon cancer tissue and cells derived 
from colon cancer tissue metastases to liver. Expression of these sequences in colon cancer 
tissue can provide diagnostic, prognostic and/or treatment information associated with the 
transformation of precancerous tissue to malignant tissue. This information can be useful 
10 in the prevention of achieving the advanced malignant state in these tissues, and can be 
important in risk assessment for a patient. 
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The following table summarizes identified polynucleotides with differential 
expression between high tumor potential colon cancer tissue and cells derived from high 
metastatic potential colon cancer cells: 



5 Table 14 

Differentially expressed polynucleotides: 
Greater expression in metastatic colon tumor tissue (lib 20) vs. 
colon tumor tissue (lib 19) 



SEQ ID NO: 


lib 19 clones 


lib 20 clones 


lib 20/lib 19 


937 


0 


6 


8 


976 


0 


5 


7 


1520 


1 


8 


11 


1546 


1 


11 


15 


1550 


1 


11 


15 


1574 


1 


8 


11 


1580 


0 


7 


9 


1590 


0 


7 


9 


1599 


8 


21 


4 


1607 


158 


632 


5 


1622 


1 


7 


9 



10 Table 15 

Greater expression in colon tumor tissue (lib 19) than metastatic colon tissue (lib 20) 



SEQ ID NO: 


lib 19 clones 


lib 20 clones 


lib 19/lib 20 


105 


64 


11 


4 


1011 


53 


1 


40 


1226 


18 


4 


3 


1571 


8 


0 


6 


1726 


15 


3 


4 


1811 


17 


2 


6 


2749 


47 


6 


6 


3146 


19 


2 


7 


3324 


20 


1 


15 
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EXAMPLE 10 

Polynucleotides Differentially Expressed in High Tumor Potential 
Colon Cancer Patient Tissue Versus Normal Patient Tissue 
A number of polynucleotide sequences have been identified that are 
5 differentially expressed between cells derived from high tumor potential colon cancer 
tissue and normal tissue. Expression of these sequences in colon cancer tissue can provide 
diagnostic, prognostic and/or treatment information associated with the prevention of the 
malignant state in these tissues, and can be important in risk assessment for a patient. For 
example, sequences that are highly expressed in the potential colon cancer cells are 
10 associated with or can be indicative of increased expression of genes or regulatory 
sequences involved in early tumor progression. A patient sample displaying an increased 
level of one or more of these polynucleotides may thus warrant closer attention or more 
frequent screening procedures to catch the malignant state as early as possible. 

The following tables summarize polynucleotides that are differentially 
15 expressed between high metastatic potential colon cancer cells and normal colon cells: 



Table 16 

Differentially expressed polynucleotides detected in samples from patient (patient 2) 
Higher expression in normal colon tissue (patient 2, lib 1 5) 
vs. tumor potential colon tissue (patient 2:libl6) 



SEQ ID NO: 


lib 1 5 clones 


lib 16 clones 


lib 16/lib 15 


69 


19 


7 


3 


105 


116 


54 


2 


140 


24 


4 


6 


197 


6 


0 


6 


198 


113 


3 


40 


254 


28 


6 


5 


412 


28 


6 


5 


642 


7 


0 


7 


830 


10 


2 


5 


938 


31 


13 


3 


1011 


209 


37 


6 


1095 


12 


3 


4 


1125 


17 


0 


18 


1129 


17 


0 


18 


1138 


109 


1 


115 
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SEQ ID NO: 


lib 15 clones 


lib 16 clones 


lib 16/lib 15 


1253 


73 


1 


77 


1283 


34 


13 


3 


1285 


34 


13 


3 


1339 


13 


3 


5 


1453 


11 


3 


4 


1474 


73 


1 


77 


1505 


18 


6 


3 


1554 


2542 


448 


6 


1605 


2542 


448 


6 


1614 


36 


14 


3 


1630 


24 


9 


3 


1643 


142 


2 


75 


1646 


39 


14 


3 


1649 


24 


8 


3 


1677 


19 


6 


3 


1753 


13 


0 


1 A 

14 


1764 


13 


0 


14 


1766 


177 


65 


3 


1772 


24 


8 


3 



Table 17 

Differentially expressed polypeptides detected in samples from patient. Lower expression 
in normal colon tissue (lib 18) than colon tumor tissue (lib 19) 



SEQ ID NO: 


lib 18 clones 


lib 19 clones 


lib 19/lib 18 


3146 


3 


19 


6 


3150 


21 


228 


10 


3324 


3 


20 


6 
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Table 18 

Differentially expressed polypeptides detected in samples from patient. Higher expression 
in normal colon tissue (lib 18) than colon tumor tissue (lib 19) 



SFO ID NO* 


lib 1 8 clones 


lib 19 clones 


lib 18/lib 19 


198 

1 /O 


21 


2 


12 


465 


6 


0 


7 


*T(J7 


6 


0 


7 


745 


6 


0 


7 


RSQ 


11 


2 


6 


Q76 
y / u 


7 


0 


8 


101 1 

lull 


209 


37 


6 


1045 


g 


1 


9 


1 HR 

1 1 JO 


14 


0 


16 


1753 


23 


0 


26 


1 ^09 


16 


4 


5 


1474 


23 


0 


26 


1 5RQ 

1 JO/ 


6 


0 


7 


1 5Q1 


22 


11 


2 


1607 


386 


158 


3 


164^* 


18 


0 


21 


175^ 


12 


0 


14 


1 764 


12 


0 


14 










opo TD NO 


1ih 1 8 clones 


lib 19 clones 


lib 19/lib 18 


1 OS 

1 UJ 


28 


64 


2 


1011 


11 


53 


4 


1226 


2 


18 


8 


1251 


6 


19 


3 


1559 


1 


9 


8 


1571 


0 


8 


7 


1608 


1 


9 


8 


1766 


2 


13 


6 


1782 


1 


9 


8 


1811 


1 


17 


15 
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Table 19 

Differentially expressed polynucleotides: 
Higher expression in colon tumor tissue 
(patient 2, lib 16) vs. normal colon tissue (patient 2, lib 15) 



SEQ ID NO: 


lib 1 5 clones 


lib 16 clones 


I1D iO/llD 13 


7 


1 
1 


y 


Q 


1 /" A 

164 


r 
0 


1 Q 

[y 


J 


734 


A 

4 




4 


836 


21 




z 


928 


2 


1 1 


c 

J 


965 


2 


1 1 


r 
J 


987 


2 


1 1 


r 

J 


1026 


7 


1 c\ 

19 


J 


1044 


4 


1 c 

16 


A 

4 


lll9 


4 


16 


4 


1 226 


8 


46 


c 

J 


1227 


0 


9 




1251 


7 


95 


1 ^ 

13 


1316 


0 


6 


c 
O 


1429 


27 


O 1 

81 


J 


1 442 


0 


9 


9 


1540 


12 


28 


2 


1553 


68 


590 


o 
5 


1560 


4 


^ A 

24 


o 


1577 


I 


10 


Q 


1 JOO 






4 


1610 


3 


13 


4 


1620 


2 


23 


11 


1626 


6 


23 


4 


1673 


2 


15 


7 


2416 


0 


7 


7 


2749 


23 


54 


2 


2976 


3 


14 


4 


3129 


26 


64 


2 


3132 


18 


54 


3 
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EXAMPLE 1 1 

Polynucleotides Differentially Expressed in Growth Factor-Stimulated Human 
Microvascular Endothelial Cells (HMEC) Relative to Untreated HMEC 

A number of polynucleotide sequences have been identified that are 
5 differentially expressed between human microvascular endothelial cells (HMEC) that have 
been treated with growth factors relative to untreated HMEC. 

Sequences that are differentially expressed between growth factor-treated 
HMEC and untreated HMEC can represent sequences encoding gene products involved in 
angiogenesis, metastasis (cell migration), and other developmental and oncogenic 
10 processes. For example, sequences that are more highly expressed in HMEC treated with 
growth factors (such as bFGF or VEGF) relative to untreated HMEC can serve as markers 
of cancer cells of higher metastatic potential. Detection of expression of these sequences in 
colon cancer tissue can provide diagnostic, prognostic and/or treatment information 
associated with the prevention of achieving the malignant state in these tissues, and can be 
15 important in risk assessment for a patient. A patient sample displaying an increased level 
of one or more of these polynucleotides may thus warrant closer attention or more frequent 
screening procedures to catch the malignant state as early as possible. 

The following table summarizes identified polynucleotides with differential 
expression between growth factor-treated and untreated HMEC. 

20 



Table 20 

Differentially expressed polynucleotides: 
Higher expression in untreated HMEC (lib 12) vs. bFGF treated HMEC (lib 13) 



SEQ ID NO: 


lib 12 clones 


lib 13 clones 


lib 12/lib 13 


849 


6 


0 


6 


1059 


6 


0 


6 


1206 


12 


2 


6 


3208 


12 


0 


12 
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Lower expression in untreated HMEC (lib 12) vs. bFGF treated HMEC (lib 13) 



2748 


3 


12 


4 


3325 


0 


6 


6 



Table 21 

Differentially expressed polynucleotides: 
Higher expression in untreated HMEC (lib 12) VEGF treated HMEC (lib!4) 



SEQ ID NO: 


lib 12 clones 


lib 14 clones 


lib 12/lib 14 


1150 


9 


0 


9 



Lower expression in untreated HMEC (lib 12) vs. VEGF treated HMEC (lib 14) 



3324 



22 



50 



EXAMPLE 12 

10 Polynucleotides Differentially Expressed in Normal Prostate Cells Relative to 

Prostate Cancer Cells 
A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from normal prostate cells and prostate 
cancer cells. Expression of these sequences prostate tissue suspected of being cancerous 

15 can provide diagnostic, prognostic and/or treatment information. These polynucleotide 
sequences can also be used in combination with other known molecular and/or biochemical 
markers. The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and low metastatic 
potential colon cancer cells: 

20 
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Table 22 

Differentially expressed polynucleotides: normal prostate cell line (lib 21) 
vs. prostate cancer cell line (lib 22) 
Higher in lib 21 



SfcQ ID NO: 


lib 21 clones 


jiD zz clones 


lih 91 /lih 99 
lit) Z Z.Z. 


53 


1 "7 

i / 


z 


0 

o 


1 7^4 
1 /Jt 


22 


8 


3 


1801 


7 


0 


7 


1845 


22 


6 


4 


446 


8 


0 


8 


1410 


6 


0 


6 


2060 


18 


6 


3 


2143 


12 


3 


4 


2632 


13 


1 


13 


2899 


16 


2 


8 


3338 


12 


2 


6 



Higher in lib 22 



86 


2 


13 


7 


93 


0 


9 


9 


687 


0 


9 


9 


1269 


1 


15 


15 


1581 


25 


74 


3 


1647 


25 


74 


3 


1649 


12 


27 


2 


1710 


5 


16 


3 


1717 


5 


16 


3 


1772 


12 


27 


2 


1960 


0 


6 


6 


2987 


0 


6 


6 


3128 


13 


42 


3 


3132 


13 


42 


3 


3150 


263 


962 


4 


3222 


0 


6 


6 


3268 


0 


6 


6 
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EXAMPLE 13 

Polynucleotides Differentially Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cancerous cells and normal cells across two or more tissue 
5 types tested (i.e., breast, colon, lung, and prostate). Expression of these sequences in a 
tissue of any origin can provide diagnostic, prognostic and/or treatment information 
associated with the prevention of achieving the malignant state in these tissues, and can be 
important in risk assessment for a patient. These polynucleotides can also serve as non- 
tissue specific markers of, for example, risk of metastasis of a tumor. The following 

10 polynucleotides were differentially expressed but without tissue type-specificity in at least 
two of the breast, colon, lung, and prostate libraries tested: 53, 105, 355, 412, 614, 836, 
1442, 1581, 1647, 1649, 1664, 1772, 1782, 1811, 1845, 1856, 1875, 1923, 2060, 2071, 
2135, 2146, 2239, 2313, 2378, 2393, 2416, 2460, 2490, 2632, 2674, 2704, 2724, 2749, 
2784, 2804, 2959, 2976, 2977, 2980, 2987, 3009, 3047, 3128, 3129, 3132, 3146, 3150, 

15 3156, 3210, 3324, 3331, and 3335. 

Those skilled in the art will recognize, or be able to ascertain, using not 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such specific embodiments and equivalents are intended to be 
encompassed by the following claims. 

20 All publications and patent applications cited in this specification are herein 

incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 

25 prior invention. 

Although the foregoing invention has been described in some detail by way 
of illustration and example for purposes of clarity of understanding, it is readily apparent to 
those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or scope 

30 of the appended claims. 

Deposit Information: 

The following materials were deposited with the American Type Culture 
Collection (ATCC); CMCC = Chiron Master Culture Collection: 
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cDNA Libraries Deposited with ATCC 







ATCC 


CMCC 


Tube Number 


Deposit Date 


Accession No. 


Accession No. 


ESI 37 


May 30, 2000 






ES138 


May 30, 2000 






ESI 39 


May 30, 2000 






ESI 40 


May 30, 2000 






ES141 


May 30, 2000 






ESI 42 


May 30, 2000 






ESI 43 


May 30, 2000 






ESI 44 


May 30, 2000 






ESI 45 


May 30, 2000 






ESI 46 


May 30, 2000 






ESI 47 


May 30, 2000 






ESI 48 


May 30, 2000 






ESI 49 


May 30, 2000 






ESI 50 


May 30, 2000 






ES151 


May 30, 2000 






ESI 52 


May 30, 2000 






ESI 53 


May 30, 2000 






ESI 54 


May 30, 2000 






ESI 55 


May 30, 2000 






ESI 56 


May 30, 2000 






ESI 57 


May 30, 2000 






ES158 


May 30, 2000 






ESI 59 


May 30, 2000 






ESI 60 


May 30, 2000 






ES161 


May 30, 2000 






ESI 62 


May 30, 2000 






ESI 63 


May 30, 2000 






ESI 64 


May 30, 2000 






ESI 65 


May 30, 2000 






ESI 66 


May 30, 2000 






ESI 67 


May 30, 2000 







Table 23 lists the clones for each deposit, designated as "tube" number. 
This deposit is provided merely as convenience to those of skill in the art, and is not an 
admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid 
sequence of the polypeptides encoded thereby, are incorporated herein by reference and are 
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controlling in the event of any conflict with the written description of sequences herein. A 
license may be required to make, use, or sell the deposited material, and no such license is 
granted hereby. 



Retrieval of Individual Clones from Deposit of Pooled Clones 
5 Where the ATCC deposit is composed of a pool of cDNA clones, the 

deposit was prepared by first transfecting each of the clones into separate bacterial cells. 
The clones were then deposited as a pool of equal mixtures in the composite deposit. 
Particular clones can be obtained from the composite deposit using methods well known in 
the art. For example, a bacterial cell containing a particular clone can be identified by 

10 isolating single colonies, and identifying colonies containing the specific clone through 
standard colony hybridization techniques, using an oligonucleotide probe or probes 
designed to specifically hybridize to a sequence of the clone insert (e.g., a probe based 
upon unmasked sequence of the encoded polynucleotide having the indicated SEQ ID NO). 
The probe should be designed to have a T m of approximately 80°C (assuming 2°C for each 

15 A or T and 4°C for each G or C). Positive colonies can then be picked, grown in culture, 
and the recombinant clone isolated. Alternatively, probes designed in this manner can be 
used to PCR to isolate a nucleic acid molecule from the pooled clones according to 
methods well known in the art, e.g., by purifying the cDNA from the deposited culture 
pool, and using the probes in PCR reactions to produce an amplified product having the 

20 corresponding desired polynucleotide sequence. 

Table 23 



Clone Name 


Tube 


M00001351A:B02 


iES 


137 


M00001356A:H1! 


ES 


137' 


M6M36'3D:D09' 


ES 


137 


M00001395D:H02 


:ES 


137 


M0600i439C:H06 


jES 


137 


M00001476B:G10 


ES 


137 


M00001582A.-E02 


ES 


137 


M00003750D:E06 


;ES 


137 


M00003761C:F02 


IES 


137 


^6603770X105 


" ES 


137 " 


M0bW3786A:AlT 


ES 


137 


mM63800a7F09~ 


JES 


137 


M00003816D:E11 


IES 


137 


ta60003902A:C03 


"|ES 


T37~ 



Clone Name 


Tube 


M00003991C:F06 


ES 


137 


M00003995B.E03 


ES 


137 


M00004046C:A08 


ES 


137 


M00004105D:D05 


ES 


137 


M60004r39B:Bi6 


ES 


137 : 


M00004140D:C03 


ES 


137 ; 


M00004i44A:H65~ 


ES 


137 ' 


'"M00004i52A:C12 


ES 


137 ; 


M00004155D:A10 


ES 


137 1 


"M66664"f68A:Gn """ 


ES 




M00004197B:H10 ~ 


Is 


137 j 


M00004222C:E03 


ES 


137 j 


M00004234A:E07 


ES 


137 ; 


M00004239B:F11 


ES 


137 j 
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Clone Name 



Tube 



M00004241B:H07 



M00004264B:A05 
M00004278A:F09 



M00004282D:C11 
M00004308C:C06 
M00004340C.C07 
M00004354D:E05 



JESJ37 

:ES 137 
ES 137 

~ Is 137 
ES 137 
ES 137 

!ES 137 



M00004361A:H02 |ES_1_37 
M00004372B:F07 *ES 1 37 
M00004378A-B10 ES F37 



M00004393B:E07 IES 137 



M00023282A:C02 

M00023300D:Ci 1 

M00623316CTg68" 

M00023333D:C12 

M00023352B:F03 

M00023352D:HOT 

M00023376B:G04 

M00023377B:FOI 

M 00023 39 8B:D 12 

M00023399C:E10 

Mbo626863A:F08 

M6b026843B:Dld" 



jES 137 
•ESl37 



M00026850D:F09 

M00026851B:F01 

M0d026856D:F02 

M00026857D:G12 

M00026859D:D01 

Mb6026860B:C05 



ES 137 
ES 137 
ES 137 
]lS 137 
,ES 137 
ES 137 
TES 137 
ES 1 37 
ES 137 
ES 137 
ES 137 
ES 137 
ES 137 
ES 137 
ES 1 37 



M00026865B.A06 



M00026868C:E1 1 



M00026878A:F05 



M00026882D.G09 



M00026885A:H09 



M00026901A:G07 



M00026914A:H10 
M00026915B:C06 



ES 137 



ES 137 



ES 137 



ES 137 



ES 137 



ES 137 



ES 137 



ES 137 
EST37 



M00026918B:D01 IES 137 



M00026922C:B02 
M0W6922C:GO3 
Mdb26926A~ErO 



ES 137 
;EST37 
iES 137 



M00026927D:F02 
m66626928D:A03 



M00026935C:B04 



IES 137 
1ESJ37 
TlS 137 



Clone Name 



Tube 



M00026941D:A04 ES 137 



M00026944B.E03 
M00026946A:F12 
r M00026980A:D09 
M00027016A:B06 
M000270I8A:C09 
M00027021A:G02 



M00027022D:G11 
M00027030C.H06 



| ES 137 
!lS 137 
ES 137 
ES 137 
'ES137 
[ES]37 
•ES 137 
ES 137 
ES 137 



ES 137 
: ES 137 

Yes f37 

IES 137 
^ES 137 
; ES 137 
iES 137 



M00027035D:C06 
M00027049B:F05 
rM00627b78AiBu2 
M0d62"7086A:B0l" 
M00027085C:E11 
M06027094A:B03 
Mb0027f03B:A09 
M00027108C:B03 

M00027121D:C05 ! ES 137 
M00027135A:B if [ES 137 

M00027136C:C09 JES 137 

M00027141C:H03 1 ES 137 
M06027T59b:Fb3 "ISOY 

M00027162B:F05 ES 1 37 

M00027178B:G09 ES 1 37 

M00b27l79D:E06 ES 138 

iM00027181D:A05 iES 138 
M00027195C:E04 ES 138 

"M6b027i98B:B08 IES 138 



M00027200A:F02 


ES 


138 


M00027207B:F07 


ES 


138 


Mb00272i2D:Eb3 


Is 


138 


M00027228D:A01 


ES 


138 


Mb0027232D:B08 


ES 


138 


Mbbb27233B:Cbl 


ES 


138 


M00027236A:E04 


ES 


138 


M00027237C:B08 


ES 


138 ~ 


M00027248A:C02 


Is 


138 


M00027256B:H09 


ES 


138 


M00027258A:A07 


ES 


138 


M00027263A.-F10 


ES 


138 


M00027292D:F10 


ES 


138 


M00027297A:C04 


Is 


138 ' 


M00027299B:B12 


Is 


138 " 


M00027301A:G05 


ES 


138 
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Clone Name 



Tube 



M00027301B:B08 



«M00027314C:D09 
iM00027319D:Bll 



IESJ38 
;ES 138" 
ES 138 



!M60027324D:C05 | ES 1 38 

\ M00027347C:G07 ES 1 38 
; M00027355 A:B07 ES 138 

;M00027359B:G05 ES 138 



PM00027366A:F11 
;M00027379C:B07 
iM00(^92BlH02 



!M00027396D:G08 



jM00027398C:F07 
jM00b27438C:G07 
M"00027462A:D07 
M00027462B-.H07 



ES 138 
ES 138 
ES 138 



ES 138 



ES 138 
ES 138 



ES 138 
ES 138 



M00027468A:C09 



M00027475B:E10 
MOo627476A : co9 
M6602imA:¥06 
M00027520A:C05 
'M00627525B:D06 
> m60027526D:F03 
M00027528C:B10 
M00027537C:B01 
M00027546C:B10 
M00027591B:C04 
M00027596A:A10 
. M00027596C:E06 
M00027602B:C01 



ES 138 



ES 138 
EST38 
'ES 138 
• ES 138 
; ES 138 
r ES 138 
TeS 138 
ES 138 
ES 138 
ES 138 
[ES 138 
•ES 138 
!ES 138 



M006276i5A:F10 



!ES 138 



M06027617B:C12 I ES 138 



M"6b027620D:Fll t ES 138 



M00027625A:H01 jES 138 



r M60027634A:Dll 


ES 


138 


M00b2764iC:A03 


ES 


138 


M0b027647C:D63 


ES 


138 


M00027652B:Fli 


ES 


138 


'M00627668C:H12 


ES 


138 


"M00(^7729D:H66 


ES 


138 


M06027733A:A62 


ES 


138 


M00027741B:F09 


ES 


138 


> Mb0027743A:C03 


ES 


138 


M00027801C:Cil 


iES 


138 


r M000278i3C:F6l 


jES 


138 



Clone Name 



Tube 



M00027818C:C07 ES 138 



:M00b27836D:F12 
|M00027837CnD692 
tM^0O2812OD:F12 
:M00028066C:D07 
;M6b028184D:GlO 
M00028185B:A06 



ES 138 
ES138 



: ESJ38 
ES 138 
ES 138 
ES 138 



;M00028196D:A03 
iM00b282blB:H12~ 
|N^b02^7D:E09"" 
tMM0282fOB;D02' 



iES 138 
ES 138 

jlsW 

ES 138 



M00028354A:B12 ES 138 
MOO 02 83 54D:A03 ES 1 38" 
M00028357A:G10 ; ES 138 



,M00028212C:B08 ES 138 
! M00028215D:F03 ES 138 



M00028220A:B04 ES 138 
"M66b28314D:F05 ES 138 
M000283T6B:Hi2 ES 138 



M00028362A:G11 


ES 


138 ; 


M00028364C:G08 ' 


ES 


138 : 


M00028369D:E08 ! 


ES 


138 : 


M0l)b28617C:A12 ! " "[ 


Is" 


138 | 


'Mbb028768C:D05 I 


ES 


138 \ 


• M00028770A:D04 ! 


ES 


138 I 


M00028772C:B09 


ES 


138 


i M00028775D:F03 


ES 


138 1 


M0b028777B:G12 


ES 


138 ; 


M0bb31368A:E10 


ES 


138 I 


M00b314i7C:G09 


ES 


138 1 


M00b31419D:C04 


ES 


138 i 


'M00031485D:G62 


ES 


138 ! 


Mb003248bB:Elb 1 


ES 


139 | 


M00032492A:Cbi 


ES 


139 i 


M00032495B:D02 


ES 


139 i 


M00032499C:A01 


ES 


139 


'MObb32508B:H03 


ES 


139 


M00032510D:F12 


ES 


139 


M00032510D:G06 


iES 


139 


M00032513D:F01 


;ES 


139 


"Mb0032530D:C02 


lis 


139 ' 


M00032535D:H01 


jES 


139 


M00032539B:C11 


ES 


139 


M00032540A:A09 


ES 


139 
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Clone Name 


Tube 


M00032541D:H08 




139 


M00032545B^H09 


ES 


139 


MUUUJzMDD.UlO 


ES 


139 


M00032550D:C02 


ES 


139 


M00032551B:G05 


ES 


139 


M00032577A:C04 


ES 


139 


M00032578A.G06 


ES 


139" 



M00032584A:H08 -ES 139 
M00032592A:H1 1 _ \ ES 1 3<T 
M^2597qB6[ I ES 139 
M00032638C:G08 TES 139 



M00032638D:A06 ES 139 



M00032668D:G12 



M06032678C:D06 
M0003T6l8D:DlT' 



M00032712B:G02 
m66632724A. ; C05 
M0d032725C:F06 
M0b032726C:C0T 



!ES 139 
"(ES 139 
iES 139 

ES 139 
'ES 139 

ES 139 
i ES 139 



M00032731B:C10 
M00b3273iC:C07 
M66b32737B:E09 
M00032739A.A06 
M0b032744B:F10 
M00032766B:D12 
M00032766C:A04 
M00032790B:A07 
M0b032793A:F66 
Mbb032797B:G02" 



;ES 139 
"ES 139 
ES 139 
'ES 139 
TES 139 
ES 139 
ES 139 
ES 139 
ES 139 
ES 139 



M00032808B:G10 
Mb0032811B:b02 



ES 139 



ES 139 



M00032829B:E06 |ES 139 



M00032830D.G03 !ES 139 



M00032831C:G07 



M00032853D:G12 
M00032864B-B09" 



M00032871D:E1 



M00032876C.D06 



M00032907A:G04 



M00032909A:B06_ 
M00032917D:G09 



M00032918B:D08 
M0b632918B:Eb6" 



M00032918C:B10 



ES 139 



ES 139 
ES 139 



ES 139 



ES 139 



ES 139 



ES 139 



ES 139 



ES 139 



ES 139 



ES 139 



Clone Name 



Tube 



i M00032921B:H08 ; ESJ39_ 
rM00032933A^Cl6_rES 139 

M00032939B:E07 ES 139 

: M00032940A:C02 ; ES 1 39 

M00032942b:C12 " "ES 139 

M00032944B.B02 ES 139 

M00032984C:G05 ES 139 



!M00032990B:A11 
M00032994A:Ab8 



|ES 139 
ES 139 



M00032995C:C05 


ES 


139 ; 


M00033007C:E01 


ES 


139 j 


- — — ■ — 

M00033019B:E10 


ES 


139 


M00033O33C:H01 


ES 


139 


M00033034C:A06 


ES 


139 ; 


M00033034C:F02 


ES 


139 ! 


Mbb033037D:Cii 


ES 


139 ' 


M00033074A.C08 


ES 


139 " ] 


M00033130B:F06 


ES 


139 


Mb0033140b:F06 


ES" 


139 1 


M00033173D:C01 


ES 


139 ' 


Mbbb33176B:E12 "" 


ES 


139 i 


M00033186C.DU 


ES 


139 i 


M00033189D:F08 


ES 


139 ' 


M00b33202D:G06 


ES 


139 


M00033204B:A07 


ES 


139 i 


Mb0b33205A:F03 


ES 


139 ; 


M00033217B.H07 


ES 


139 1 


M00033218A:C04 


ES 


139 ; 


M00033223B:H07 


ES 


139 i 


M00033226A:A11 


ES 


139 


M00033231D:B09 


ES 


----- 


M00033231D:G10 


ES 


139 


Mb0033243B:A05 


ES 


139 


M00033246C:E08 


ES 


139 


M00033248A:B02 


ES 


139" 


M00033261C:D12 


"es 


139 


M00033262D:A11 


ES 


139 


M00033263B:G04 


ES 


139 


M00033276B:G08 


ES 


139 


"Mboo33T85crDbi 


ES 


139 


M00033288B:D12 


ES 


140 


M00033300D:H12 


ES 


140 


M00033306D:G08 


ES 


140 


M00033306D:H09 


ES 


140 
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Clone Name 


Tube 


M00033308B:G05 


ES 


140 


M00033343C:H08 


ES 


140 


M00033345D:A09 


ES 


140 


M00033346C:A05 


ES 


140 


M00033347C:F02 


ES 


140 


M00033349D:F05 


ES 


140 


M00033358A:H12 


ES 


140 


M00033362C:C05 


ES 


140 


M00033375A:G04 


ES 


140' 


M00033376A:C12 


ES 


140 


M00033377D:A05 


ES 


140 


M00033410B:C09 


ES 


140 


M00033424B:A04 


ES 


140 


M00033424D:H12 


ES 


140 


M00033425A:C10 


ES 


140 


M00033427D:F01 


ES 


140 


M00033432B:H10 


ES 


140 


M00033437C:A07 


Is 


140 


M00033437C:C03 


ES 


140 


M00033442A:D06 


ES 


140 


M00033446C:G08 


ES 


140 ~ 


M00~033446D:B02 


ES 


140 


M00033450C.A02 


ES 


140 


N40063345TA:H01 


ES 


140 


M00033454A:D09 


ES 


140 


M00033457D:A05 


ES 


140 " 


M00033560D:G07 


ES 


140 


M00033561C:A02 


ES 


140 


M00033566C:E08 


ES 


140 


M00033570B:C08 


ES 


140 


M00033570B:E06 


ES 


140 


M00033570C:C10 


ES 


140 


M00033578D:G02 


ES 


140 


M00033581C:H10 


ES 


140 


M00033581D:D08 


ES 


140 


M00033583B:E06 


Is 


140 


M00033583D:B05 


ES 


140 


M00033584D:Gl l 


ES 


140 


M00033585D:A02 


ES 


140 


M0003i5aaC:(jU4 


ES 


140 


M00033594C:B03 


ES 


140 


M00033595A:C11 


! ES 


140 


M00038259A:G08 


; es 


140 


M00038259B:A02 


|ES 


140 



Clone Name 


Tube 


M00038259B:G08 


ES 


140 


M00038259C:H09 


ES 


140 


M00038272A:G01 


ES 


140 



M00038272D:F11 
M00038279C:A11 

:M00038284B:H04 
M00038303A:C03 

! M00038303C:D02 



! M00038303D:EO7 



JESJ40 
ES 140 
ES 140 
ES 140 
ES 140 
ES 140 



:M00038315C:G11 
M00038325D:F12 

'M00638326B:D04 
M00038327A:C1 1 



i M00038327D:A05 
M00038328D:A03 



M00038329A:E08 



M00038387B.A07 



|ES 140 
ES 140 

jlsW 

ES 140 
ES 140 
ES 140 
'*ES 140 
ES 140 
"ES 140 
ES 140 
ES 140 
ES 140 
lis 140 
ES 140 
ES 140 
ES 140 
ES 140 
ES 140 
ES 140 
;ES 140 
jlS J40 
ES 140 
[ES 140 
:ES 140 

M0003 866 1 A: A07 ! ES 1 40 



|M00038614C:H11 
M000386i5A:1112 
tM000J8liT6D-BT2"" 
;M00^86l8C^C08 
: M00638619B:A03 
M00038620B:E09 
M00038631C:B10 



M00038631D:B02 
M00038632C:B09 
M00038633A.D07 
M00038633B:G02 



M00038635A:G09 



M00038635B:C08 



M00038638D:H03 



M00038639B:C03 
M00038639D:F07 



M00038662B:A12 ES 140 



!M00038663B:H06 



M00038663D:H10 



ES 140 



ES 140 



M00038664CE04 


ES 


140 


M00038991A:D01 


ES 


140 


M00038994A:A10 


ES 


140 


M00038995C:G08 


ES 


140 


M00038995D:E05 


ES 


140 


M00038999B:G1 1 


ES 


140 


M00038999D:C11 


ES 


140 
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Clone Name 


Tube 


M00039UUzL):u 1 1 


c c 

ES 


1 A A 

14U 


X A AAA^AAA/Hj. A (\£L 

Muu039u04B:AvJo 


bb 


1 A A 
14U 


MUUUjyUU4t>.U 1 1 


cc 
bb 


1 vlA 

14U 


MUUUJyUlDL.HU 1 


cc 
bb 


1 A 1 


maaaiqaaapi-ra i 


ETC 

bb 


I A 1 

141 


\iAAfl1AA1 1 H.^l A 


bo 


1 /1 1 
J41 


iv/iaaa'jga 1 1 a -pag 


cc 
bb 


141 


AjfAAA^AAl in.EAl 

M00039U1 3D:r[)Z 


CC 

bb 


i/ii 
141 


AjtAAA'J AA 1 /I A • LJ 1 A 

MUUujyU 1 4A.H I U 


bb 


I41 


AvfAAAIGA 1 A Q -PA/I 


bb 


i/ii 
141 


A/fAAA'JOAl < A • C»A'7 


ES 


i/ii 
141 


A iAAAIAA ICD.nn 


CC 

ES 


t /i i 
141 


H/fAAAIAAl CD.UAfl 


ES 


I4l 


M00039015L):H04 


c c 
bb 


i/ii 
141 


\jfAAA1AA 1 C A • A AO 

iviuuujyu 1 oA. AUz 


cc 
bb 


1 A 1 

141 


ajiaaaqaai ^crv^n/i 


cc 
bb 


i/ii 
141 


A;f AAA^ AAO/1 D.D1 A 

M0UU39Uz4t>.b 1 1) 


cc 

bS 


i/ii 
141 


X4AAATAATC A .UAA 


I"" 1 O 

ES 


141 


M00039026L):ru5 


ES 


1 A 1 

141 


M00039028C:B1 1 


ES 


i/ii 
I4l 


X 4AAAT ArtTnn .P A1 

M00039030B:b()2 


ES 


I4l 


M00039036C:B05 


C c 

ES 


141 


A JAAAOAAl AD .CAT 

M00039U39B:bU3 


ES 


i/ii 
I4l 


\ JAAA1 AA1AD. CAO 

MU U 03 9 U3 9 b> . r U9 


c c 

ES 


i/ii 
I4l 


A 4AAA0 A A /I TD.DAO 

M0UU39U4zb:BUz 


ES 


i/ii 
141 


* A AAAOAAy) O D • CA 1 

MUUU3 9U43 t> . bU 1 


cc 

bb 


i/ii 
14 1 


x a aaa"j AA/i nn.r , A7 
MUUU39U49L).vjU / 


c c 
bb 


i/ii 
14 1 


IV jf AAA'S OA C A A -U 1 A 

MUUU39UJUA.H 1 U 


cc 

bS 


i/ii 
141 


\^AAA1AACOP.CA7 

M0UU39Uj2C.rU/ 


cc 

bb 


i/ii 
141 


\ jf AAA'S OAC O A • A A/1 

MUUU39UjoA.AU4 


cc 

ES 


i/ii 
141 


A yf AAA"5AAC OP.UA1 

MUUU39UjoC.HUz 


C O 

ES 


141 


M00039059C:Cj0o 


cc 
bb 


1 A 1 

141 


A>TAAA1 OA£ 1 P • CAft 


cc 

bb 


1/11 

141 


A AA A AT O A/C 1 D • r\AC 

MUUU3 y Uo3 o . DUo 


cc 
ES 


141 


\A AAA1GAA4FV14AQ 


CC 

bb 


1/11 
141 


A./f AAAlOA^C/Cri'^Aff 

MUUUJyUOOU.UUo 


cc 
bb 


1 A 1 

141 


1\ 4 A A AT G A£ fi 13 • Tl A/t 


cc 
bb 


1/11 
141 


AyfAAAlGA^QP-CAA 

MUUU3yUOoL/.bUO 


r cc 
bb 


1 A 1 
141 


\A AAA7QA7ACI-PA'} 


ES 


141 


M00039072C:C03 


ES 


141 


M00039072C:E02 


ES 


141 


M00039079A:A05 


ES 


141 


M00039080C:H06 


ES 


141 


M00039081B:G06 


ES 


141 



Clone Name 


Tube 


M00039082BA05 


PC 


1 H 1 


M00039084CG07 




141 

I *T I 


M00039084C H03 


CO 


TdT 

1 *T 1 


M00039084C:H04 


cc 


Ml 
1 


M0003 9084D D07 


CO 


1 T 1 


M00039096A:A05 


CO 


1 *T 1 


M00039096A:E07 


cc 

Co 


1 A 1 


M00039097DD06 


F<i 

CO 


141 


M00039099AH08 


cc 

CO 


1/tl 




bo 


1 A 1 
14 1 


M00039105CB08 


PC 
CO 




M00039107CE04 


cc 

Co 


1 *+ 1 


M00039108D-B06 


cc 
bo 


141 


M000391 12BC05 


cc 

CO 


1/11 

141 


M00039118BC05 


cc 
bo 


1/11 
141 


M000391 18D-A06 


cc 
bo 


1 A 1 
141 




cc 
bb 


1 A 1 
141 


M000391?nrH03 


cc 
bb 


1/11 
141 


„ 


cc 
bb 


141 




— 

cc 
bb 


1 A 1 

141 


Moofno i ?4r • ho? 

lrl\J\J\Jjy 1 ^T-^.OV/^. 


cc 
bb 


i/ii 
141 


lVlV/V/V/J^ 1 Z«*tV^. JT1VO 


cc 
bb 


i /i t 
141 


MO 0 0 ^ 0 1 1 9 fs O ■ A 0 8 


CO 


141 
141 


M06039127A G l I 

l»Iv/\-/V/_/ ^/ / / K . VJ 1 1 


cc 
Co 


141 
141 


M00039127D-E10 


CC 
CO 


141 


M000391 29C D04 


cc 

Co 


141 
141 


M00039133B F08 


cc 

CO 


141 


M00039135D-F05 


cc 

CO 


141 


M00039135DG02 


cc 

CO 


141 
141 


M00039135DH02 


cc 

CO 


141 
141 


M00039i39AC09 1 


cc 
bb 


1 A 1 
141 




cc 

CO 


141 

141 


M00039140AB08 


CO 


141 

141 


M00039140D'A04 


CO 


141 
141 


M00039 1 40DD09 

i ▼ i \j v \j i ~viy *iyvy 


CO 


141 


M00039141CE01 


FS 


141 


M00039142D:B11 


ES 


741 


M00639i44C:E06 


ES 


141 ! 


M00039147A:Ff6 


ES 


141 ! 


M00039156A:B11 


ES 


141 | 


r M00039158B:Gi2 


ES 


141 



M00039j66B:G06^ "jlS 141 
|M00039167B^H09 jESMl 
"M00039168C:A04 r ES 141 
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Vvione in dine 


Tube 


MUUUjV 1 OVA.Jb 1 Z 


I"" 1 o 

ES 


1 A 1 

141 


\yfAAA?Q 1 7A A «D 1 A 

MUUUjVI /UA.dIU 


ETC 


141 


MOAA^QI 70P F0S 


ES 


141 


iv4aaaigi 71 R-ni 1 

MUUU-JV1 / 1D.U1 1 


bi> 


i /i 1 
141 


mo nn^ qT 7 7 r • no ^ 


cc 
bo 


1 A 1 

141 


Mono/io i 70 a -no9 

1V1UUUJ 1 "! lyt\.\J\)7 


bb 


1/11 

141 


maaa^qi koa • A07 

MUUU.)" 1 OvA.rw / 


ES 


141 


AyfAAA'JOl QAQ'UAA 


ES 


141 


MUUUj:/ I yOU.rxyJ f 


co 

ES 


i/ii 
141 


MUUU J7ZUUA. 1 U 


bb 


141 


m aaa^07 1 1 a -p \ 7 

MUUU JyL 1 l/\.v^IZ 


ES 


141 


XyfAAA'^O'") 1 1 O 

MUUU jyl 1 ZCC 1 Z 


ES 


142 


Mfinrncn i ^ a -no i 

MUUUJVZ 1 J A.UU J 


ES 


142 


M0UUJ9Z 1 Jr>.rU:> 


ES 


142 


iv/fAAA / *G7 i ft a -FOi 


ES 


142 


A^AAA'JOOO 1 A ■Uftl 


ES 


142 


\yf A A AT AO O/l A -C 1 O 

MUUU39ZZ4A.b 1 Z 


ES 


142 


\yfAAA7AOOQ A .QH^ 


ES 


142 


lV^AA A Y077 A A • A 1 A 
MUUUjyZ.JUA.AlU 


ES 


142 


A >I A A Al ATI AC* • Pt AO 


ES 


142 


\A AAAIQO'} api-p 1 7 
MUUUJyZjUlJ.vJ 1Z 


ES 


142 


X JfAAA"5 AOn A . A f\1 


ES 


142 


MUUU J 9Z 3 0 A . o 1 Z 


ES 


142 


lVyfAAA^ ATI OH- A AC 

MUUUJVZjoU.AUo 


ES 


142 


1V^AAA'*G7/11 A -PI 1 


ES 


1 /I ^ 

142 


lV/f AAA^G7zlQ A *P 1 7 
MUUUjyZ^tVA.L^ 1Z 


ES 


142 


A4AAA'3Q7/lGP*P1 1 
MUUUjVZ^yU.vj 1 1 


ES 


1 jl '"I 

142 


1V4 AAA'5Q7^P-P 1 7 
MUUU-SyZjjCC IZ 


c o 

ES 


1 4 ^ 
142 


\4 AAA^07^7n>PAl 
MUUUjVZ J / LJ.V^UJ 


c o 

ES 


142 


\/f AAA1Q7^0.R*Fn£ 
MUUUJyZJOO.nUO 


ES 


142 


\yfAAAlG7^8rVRA8 
MUUUjVZjoL/.DUO 


c o 

ES 


142 


\yfAAA1Q7^nP-nni 
MUUUjyZOUV^.vJUJ 


ES 


142 


JV/fAAA7Q7A'}n* A 17 
MUUU J VZO J 17. A 1 z 


c c 

ES 


1 ylO 

142 


M AAA1Q7££ A -R07 
MUUUjVZDOA.OUZ 


ES 


1 /IO 

142 


MAAA'*Q7AAn*F 1 9 

iviuuujyzooL'.r iz- 


cc 

bo 


i /io 
142 




cc 

ES 


1 /IT 

14Z 


\4AAni077^R*P07 
MUUUjz/Z / jD.ruz 


cc 

bo 


1 /io 
14Z 


MAAmo77^nun7 

MUUUjVZ / J L/. DUZ 


cc 

bb 


14Z 


MOfi01Q774R-G07 


cc 
to 


1 A7 
14Z 


M00039276BH09 


ES 


"142 


M00039277D:G10 


ES 


142 


M00039279B:C1 1 


ES 


142 


M00039279B:H02 


ES 


142 


M00039279C:B08 


ES 


142 



Clone Name 



Tube 





ES 


142 




ES 


142 


M00W9286A : C66 


ES 


T42""1 


M00039287C*A06 


ES 


142 


M00039288CB 1 1 


ES 


142 ' 


M00039293AH04 


ES 


142 


M00039293B Cl l 


ES 


142 


M00039295B-D03 


ES 


142 ! 


M00039297C-H08 

I'lUUv J 7L J 1 v_-' * 1 1 W 


ES 


142" ■ 


M00039298BB06 


ES 


142 


M00039298BD03 

LVl\J\J\J J 7 V OU . m^/\J^J 


ES 


142 ~ : 


M00039298D*b64 1 


ES 


142 


M0n0^Q799R-ri 1 7 




T47 1 


M00039306c*Cd9 


ES 


142 


Mnoo^9^oorri04 

ivi uuu j y j \j . vj 


FS 


142 




FS 


142 




FS 


142 


Mnon^9in4n*R09 


FS 


142 


MnnoidYnR r * n n r 

IVlUv/U J7JUOD.VJUO 




147 • 


? Mnnn^d^ 1 n a pa7 


FS 


147 


: IVIUUUJV-) 1 JL/.VJUH 




... ... 

1 *tZ« 


. lVlV/WV/»y 1 V7/V . \J 1 


ES 


142 


M0 001 911 RR~Rn9 




j 42 — 


M 000393 1 9 B • H 1 2 


ES 


142 


M00039319C-A04 

i 1VIUV/V/J7J 1 7V^ . /v\7" 


ES 


142 


M00039322AF04 


ES 


142 


M00039328DD07 


ES 


142 


j 


ES 


142 


M00039329CB10 


ES 


142 


M00039333DD09 


ES 


142 


Mbb639334B-Eb3 1 


ES 


142 


M00039335AE08 


ES 


142 


M00039339AH07 


ES 


142 


M00039339CF03 


ES 


142 


M00039340AD05 


ES 


142 


M00039340BE07 


ES 


142 


M00039340B-G08 


ES 


142 


M00039341CH1 1 


ES 


142 


M00019341DD07 


FS 

L/w7 


142 


)M00039343B:F12 


ES 


142 


M00039344B:G07 


ES 


142 


M00039345A:D09 


ES 


142 


M00039345C:C12 


ES 


142 


M00039381C:H08 


ES 


142 
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Clone Name 



Tube 



M00039398A:B10 
"Md6039401B:D02 


ES 142 


ES 142 


M00039402B:E03 


ES 142 



M0003938ID:C02 j ES 142 

M000393 84C.E02 ES142 
M00039384C7F08 ES 142 



M00039385B:E09 
Md0039391D:F08 
M00039396D:B04 
M00039397B:H09 



ES 142 
1EST42 

ES 142 
"ES 142 



M00039403A:G12 



M00039404B.A05 



M00039407B:G02 
M00039412D:G06 



ES 142 



ES 142 



ES 142 



ES 142 



M00039414D:G03 
M060394r5D:E6l 
M00039417A:D03 
M06639417A:E12 
M00039417B:F01 



ES 142 
ES 142 
iES 142 



M00039417C.G01 
M00039418B:D08 
M06"039426b:D(j3 
M00039422D:F04 
M0d039425C:G0i 
M00039425D:E12 
M"0b039428C:E0i 



ES 142 
ES 142 



ES 143 
ES743 



M00039430B:F12 



M00039431B:F04 



M00039432C:A01 



M00039444C:H02 



M00039452C:G09 



M00039454B:A11 

M6oT9455D:H04" 

M06d39«6XC08~ 



ES 143 
ES 143 
ES 143 
ES 143 
ES 143 
ES 143 
ES 143 
ES 143 



ES 143 



ES 143 



ES 143 



ES 143 



ES 143 



ES 143 



ES 143 



M00039458B:H11 : ES 143 



M00039461A:F04 jES 143 



M00039465A:A08 



M00039472C:B08 
M00039475C:E10 



M00039476B:A02 



M00039477A:B03 



M00039477D:A10 



ES 143 



ES 143 



ES 143 



ES 143 
ES 143" 



ES 143 



Clone Name 



Tube 



il^003961]p:Dn ESJ43 
"M00639612B^B16]7TeS 143" 
M000396 1 2B:G05 'ES 1 43 



;N400039616A:B10 
' M0603961_6B:C0l " 
M00039619B:D02 
M00039631A:C10 



IM0q039633D:D05^ 
M00039636C:D1 1 
M0b039637C:A10 



ES 143 
":ES 143 
ES 143 
ES 143 
jES 143 
ES 143 
ES 143 



!M00039652B:D05 | ES 143 



!M00039655B:H09 



!M00039655C:C07 



|ES 143 
ES 143 



!M00039655C:E08 



iM00039660C:C10 



jES 143 
ES 143 



|M00039663C:G09 jES 143 
i Mbb039664D:G07 : ES 1 43 



iM00039672Dd)10 jES 143 
"M00039673A:F09" t ES 143 



;M00039675D:B03 
r Mbb039675D:Hb5 
iM0b03%77A:B08 
Mb003968TB:H09 
;M00039682A:Cb8 
:M00039682C:H11 



i M00039684D:B08 
M00039685A.A08 



ES 143 

jlsW 

ES 143 

]es"143" 

ES 143 
ES 143 
'ES 143 
[ES 143 



M00039771C:E1 1 ) ES 143 
|Mbbb39773D-A09^ [ES 143 



M00039686C:C05 


ES 143 


M00039686C:E06 1 


ES 143 


M00039688C:G06 


ES 143 


M00039689C:E08 


ES 143 


M00039696A.E05 


ES 143 


M00039697B:F11 


ES 143 


M0b039700B:bb2 


ES 143 


M00039702A:B12 


ES 143 


M00039702A:B02 | ES 143 


M00039705D:F02 


ES 143 


Mbbb397b7A:D02 


ES 143 


M00039710C:G03 


ES 143 


M0b03972bb:Db2 1 


ES 143 


M00039727C:B09 


ES 143 
ES 143 


M00039729A.A10 
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Clone Name 


Tube 


M00039773D:F11 j 


ES 


143 | 


"Md003"9774C:A63~ 


ES" 


143 


M00039774C:C09 ; ES 


143 


M00039775A:A09 


ES 


143 


M00039777C:E05 " 


ES 


143 


M00039778B:G03 


ES 


143 


M00039778C.A04 


ES 


143 


M00039781D:D10 


ES 


143 


M00039782A:Hi6 


ES 


143 


M00039785D:G05 


ES 


143 


M00039788A:E03 


ES 


143 


M00039788B:A06 


ES 


143 


M00039788C:A01 


ES 


143 


m6M39790BTD03 ~" 


ES 


143 


M00039792A:B04 


ES 


143 


M00039793D:C05 


ES 


143 


M00"039794A":E04 


ES 


143 


M00039795B:H10 


ES 


143 


M00"039795D:El6 


ES 


143 


M00039795D:G06 


ES 


143 


m66039797C:G05 


ES 


143 


M00039798B:B02 


ES 


143 


M00039799A:D10 


ES 


143 


M00039801A:H11 


ES 


143 


M00039807A:D01 


ES 


143 


M0b639808D:H02 


ES 


143 


M00039810A:H10 


ES 


143 


M600398l3B:B01 


ES 


144 


M00039813B:D11 


ES 


144 


M00039815C:F09 


ES 


144 


M00039816B:D04 


ES 


144 


M00039816C.D05 


ES 


144 


M00039820A:F11 


ES 


144 


M00039820A:H1 1 


ES 


144 


M00039820B:B06 


ES 


144 


MUUUiyoZ /D.rU/ 


pc 


144 


M00039828B:C05 


ES 


144 


M00039832A.B12 


ES 


144 


M00039835A:F07 


ES 


[ HH 


M00039838A:F05 


ES 


144 


M00039839B:B01 


ES 


144 


M00039839C:E05 


ES 


144 


M00039847A:F06 


ES 


144 


M00039851B:G11 


Ies 


144 



Clone Name 



Tube 



! ES 1 44 
" ES 144 



M_00039851_C:D12 |ES|44 

!M00039854B^F09 ES 144" 

M00039855C:F01 ES 144 
:M00639857B:G10 

M0d039859A:F06 

M00039859C.G10 

M00039864A:A07 
iM00039866B:A08 
r M00039869BrF06~" 

MOO 6 3 9 8 75D : A 10 



ES 144 
ES 144 



|ES 144 
IES 144" 
ES 144 



! M00039876D:H09 ES 144 
tM00^877C:C6T iESl44 



!M00039879C:F05 ES 144 



M00039879D:B1 1 !ES144 



M00039880A:H1 1 



M00039884A:H11 
Mb0039885C. : D0Y" 



ES]44 
ES 144 
ES 144 



ES 144 
ES 144 



M00039887C:E07_ 
M00d39887D:C04 

!M06039888B:D03 ES 144 
1 M06d39890A:H05 "ES 144 

, M00039894C.H07 'ES 144 

MOC039896C:H01 FS 144 
r M00039897b:C10 'ES 1 44 

M00039898A:A08 ES 144 

M00039898b:C06 ES 144 

M00039903A.H07 ES 144 

M00639903C:D01 ' ES T 44 



|M00039903C:F03 



i M00039909C:G05 



I M00039909D:C02 



!M00039910C:G10 



!M00039914D:G12 



|M00039915D:C11 



iM00039927A:F04 



M00039928B:G05 



ES 144 



ES 144 



ES 144 



ES 144 



ES 144 



ES 144 



ES 144 



ES 144 



M00039936C:C05 



M00039938C:A08 



M00039938C:E11 



M00039940A:D07 



ES 144 



ES 144 



ES 144 



ES 144 



M00039940D:G08 



M00039973D.C08 



M00039973D:D12 



M00039975C:C1 



ES 144 



ES 144 



ES 144 



ES 144 
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Clone Name 



Tube 



M00039976D:A12__ |ES [44 
M0003997i8A:G03 ]ES 144 
M0003998 1 A:E08 ES 144 
M00039982C:H04 " j ES 144 
M60039983D:A06 TeS 144 
M00039984A:C02 [ES 144 
M06039984B:G12 i ES 144 



|ES 144 

:esT44 

;ES 144 
!ESl44 
r ES 144 



M00039984D:G 12 j ES 1 44 
m66039987A:F09 ~ [ES 144 

M0M39^7C^E12 jESJ 44~ 

M00039987C:G08 1 ES 144 

M00039988A:E06 [ES 144 
M00039990C:D10 
M00040004DTb03" 
M00040005B:C11 
M6004flWd5DiB07" 
M06040607D:A06 

Woo4o6o9d7bo"7 

M00040010A:Flb 
jM00040014B:p6l 
fM00040014D:DfO 
jM660406T4D-F03~ 
;M00040015C:F08 
!M000406i6C:H12 
!M00040017A:C06 
M00040017b:G03 
M00040019A.E01 
M06646021A:F09 



IES 144 
'ES 144 



M00040040A:A06 



M00040041C:C09 



M00040042B£AJ0 
M66040047C:F0r 



M00040052D.F1 2 



! ES 144 
|ES"i44 
iES 144 
"[ES"l44 
:ES 144 
ES 144 
ES 144 
ES 144 
1 ES 144 



M00040022C:D06 | ES 144 
M00040026B:F06 
Mb6640029A:B03 
M00040029A:G04 



|ES 144 
iES 144 



ES 144 



M00040031A:E06 


ES 


144 


M00040032A:B03 


ES 


144 


M00040032A:D09 


ES 


144 


M00040037A:E1 1 


ES 


145 


M00040038D.GO4 


ES 


145 



M00040039D:D06 |ES 145 



IES 145 
TES145" 



IES 145 



IES 145 
TesT45" 



M00040055D:A06 iES 145 



Clone Name 



M00040055JD:B01 
"M0004006bC:H10" 
; M00040062B^B05 
r Mbb040070B:B07" 
r M0004007l"B:A10" 

M00040072C.G09 

M00040076C:b06" 
;M6qb40077D:Cil 

M0004008bC:C06~ 
>M00040081C:EOI 

M00040085D:A10 
r M00040085b:Eb4~ 

M00040087D:F08 
:M(^61b0l8C:E10" 

M00040089A:G08 



Tube 



ES 145 
Tes \45 
_ES 145 
ES"l45 
ES145 
ES 145 
1 ES 145 
'ES145 



|M00040089B:E04 
M06040089C:E06 



M00040090B:G09 
M0004b092B:F05 

'M00040b93B:C02 
Mbbb4bb93D:b03 

; M06040097"A:C12 
Mb0b40098C:BOf 

: Mbb040b98D:E04 
M00040098D:G12 
M00040!00C:E05 
M00040100D:B06 

1vi00040103B:H10 



ES 145 



ES 145 



ES 145 
ES145~ 
ES 145 
ES 145 
ES 145 
ES 145 



ES 145 

:ES 145 
|ES 145 



ES 145 
W\45 
ES 145 
ES 145" 
ES 145 
ES 145 
ES 145 
ES 145 
ES 145 



|M00040105C:F11 
:'Mb0046lb6B:Bb9 
fMbbb40lb7B:HbT 
M00040111C:Db5 



ES 145 



ES 145 



r M00040115B:A04 



!M00040115B:H12 
M0b040lT8D:Glb 



ES 145 



ES 145 



ES 145 



iM00040121B:C05 



ES 145 
ES145 
ES 145 



M00040122D:A02 


ES 


145 


M00040123A:A09 


ES 


145 


M00040124D:H01 


ES 


145 




ES 


145 


M00040302C:A04 


Is 


145 ' 


M00040304B.F06 


ES 


145 


M00040305A:D11 


ES 


145 



I M00040305C:H06 



ES 145 
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Clone iName 


Tube 


M00040307B:rOl 


ES 


145 


H / AAA >1 AO AnP.CI A 

M0004l)3U7L:r JU 


ES 


145 


1V4AAA/1 A1 AG A -P 1 1 


ES 


145 


M000U4ozjL):UU3 


ES 


145 


\ jf A A A A A Q 0 0 F\ • UAO 

MUUUU4o3zD.HUz 


ES 


145 


/f AAAA/t C2 • Lin'7 


ES 


145 


IViVVvUjV lOrX.DVJ 


ES 


145 


A A AAAACOAOT"\.LJAQ. i 


ES 


145 


M0000530oA:UOo 


ES" 


145 


H iAAAACI C 1 P.r'AC 

M0000535 1C:CjU5 


ES 


145 


AA AAAAC2 CO/ 0 - A HO 

MUUUUj3jzC.AUZ 


ES 


145 


M00005358B:B06 


ES 


145 


\ * AAAAf 1 C A A .C\Aj1 

M00005359A:DU4 


ES 


145 


M00005379A:bU4 


ES 


145 


K AAAAACl OOQ .CAQ 


ES 


145 


M00005384A:CI 1 


ES 


145 


M000054O2B:rO8 


ES 


145" 


M00005445D:B01 , 


ES 


145 " 


M00005449B:B10 


ES 


145" 


M00005449B:D01 


ES 


145 


iM00005457C:"A(j3 


is 


145 


|M00005458A:F1 1 


ES 


145 


i » m i\f\f\/\r 4 AO A T IA^ 

I M00005498A:H06 


ES 


145 


: •» m AAA AC CI 1 Ct.CAA 


ES 


145 


! * jfAAAAC C"2 OTV/^ 1 A 1 
* fAAAArrrr a . A 1A 

M00005555A:A10 


ES 


145 1 


ES 


145 \ 


M000U j 5joo. Di)Z 


ES 


145 


; -\ m AAA AC/: A 1 P\ . HAC 

M000U jdU l U.DUo 


ES 


145 


AjTAAAAC^I /I o .D A 1 


ES 


145 


k inAAAC/n A . AO 

M00005623A:uO2 


ES 


145 


* lAAAAc^nn.r i o 

M00005623D:Cj12 


ES 


145 


A jf AAAAC ZOC A ./^fil 

M00005625A:COz 


ES 


145 


TVjfAAAAC/COID.IJ 1 0 

MUUUU jo /3B.D 1 z 


ES 


145 


\ jf AAAAC7TOT5 .17 AG 


ES 


145 


\ lAAAAcoAcn. r\ 1 o 


ES 


145 


A jfAAAAC QO Af *C A/1 


ES 


145 


MUUUUOjo 1 U.ruo 


ES 




145 


X JAAAA/C C flQTVO AO 


ES 


145 


AjfAAAA£ £C7P-nfK 


ES 


145 


X jf AAAA££ OAD . F\AO 

MOOOOoooUB : DUz 


ES 


145 


M00006712C:H09 


ES 


145 


M00006809B:B09 


ES 


H5~ 


M"00006861B:F09 


ES 


145 


M00006866A:D07 


ES 


146 





Tube 


MO0006886DH02 


PC 

Lo 


1 /I ^ 


M00006891CF07 


re 

bb 


1 A(L 

14o 


MOOn0fiR97 A • H02 


re 

bb 


14t) 


Tmmioo692 8 n > ■ no7 


bb 


1 AH 

140 


M0000691 5 C* F06 




140 


M00006968A G08 




1 Af. 
140 


M00006977CG04 


cc 
bb 


14o 


moooo6077D' aoi 




140 


M0000701 2D H08 

1 ▼ 1 Vy Vy Vy Vy / vy \ \S • 1 iv/vj 


ccT 
bb 


140 


M000070 13 AD09 


Ire 
bb 


140 




rc 

to 


140 


M00007108BA02 


CC 


140 


M00007 1 12C * Bl 6 

1 VI Vy Vy v/ Vy /ll ^> • XJ l V 


cc 

bS 


1 AC 

146 


iyI\J\J\J\J f l 10V_/.VJv-^ 


cc 
bb 


1 A& 

140 


M00007124DH10 

: 1V1V/V7 \J\J f i ^TL/.l 1 1 v 


cc 

bS 


146 


\AO()()Ci7\ 16 A- AO^ 

;1V1UI/UU/ l jUrV.nuJ 


ES 


1 A £. 

146 


Mnnnn7 1 40 a no? 


cc 

ES 


i a a 

146 


M0nn071 S7PF1 1 


ES 


\ A £L 

146 


i / 1 OjD.u 1 1 


cc 

ES 


\ AC. 

146 


Mnonn7 1 Qd A ROQ 


ES 


\ A C 

146 


\a nnriri 70? qc ror 

: lviuuuu i y Ly\-> .duo 


ES 


1 AC 

146 


Mnnnn7ok i fS-pno 

xy\\}\}\}\) 17*+ 1 


ES 


1 A C 

146 


m n n fi in 7 o a i fv r fi o 


ES 


1 A C 

146 


GOOD • U i 0 

iviuuwu / " / zo. n i ^ 


cc 
bS 


1 AC 

146 


M O 00 07076 A * C I 0 

; 1V1WWWV/ / s 1 \7/V . \^ 1 v 


ES 


\ A £ 

146 


, F06 


cc 
bb 


1 AC 

146 


M00007904A-G02 


cc 

bb 


1 AC 

146 


IVlUUUuOviUUJJ.lJV/J 


cc 
bb 


1 AC 

146 


IVlV/V/wV/Ov/^. . V_/ 1 1 


ES 


1 AC 

146 


M0000R04S A -HO? 


r 1 c 

bS 


1 A C 

146 


MnnonROsTA f i o 


cc 
bb 


140 


M00008063BA06 


cc 
bb 


1 A A 

140 


M00021665BF12 


bb 


1 AC 
140 


M00021671D-F12 

IvlVyVyVy^. i vj / i \ 


bb 


1 A C 
140 


M00021852DA05 


bb 


140 


l v i Vy Vy V/ ^> i ouui/tnuj 


PC 

bb 


1 A C 
140 


^M00021908DG12 

j i V I Vy V/ Vy ^» I 7 VUl/iVJ I +~ 


cc 
bb 


1 A c 
140 


^M00021019C*A10 

; lVlUVWi« 17 1 / Vy . /v 1 V7 


PC 

cb 


1 AC 
140 


M00021923CD11 

i t i vy vy vy ^ i v ^ — y » i— ^ i i 






M00021955A:H02 


ES 


146 


|M00021964C:E10 


ES 


146 


M00021972D.C11 


ES 


146 


|M00022005C:C06 


ES 


T46~" 


|M00022015B:B07 


ES 


146 
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Tube 


mooo??os4A-hoi 

iVl UU l/Z Z \J J t/t. . n 1/ J 


cc 

bo 


1 AC 


Mooo??OR4n-ROi 

1V1UUUZZUOtL/.DU 1 


CC 

bo 


1 /1A 


M00079099BD66 


cc 
bo 


1 A & 


mooo??i oscci? 

IV1UUUZZ 1 vJv.v I 


bo 




Mono?? i Vine hoi "= 


bo 




Moob?? 1 i sr ros 

1V1UUUZZ 1 J J\~> . UuJ 


cc 
bo 


1 AC 
I HO 


M0007711RAFOS 

1V1UUUZZ 1 JOA.LvJ 


cc 
bo 


1 AC 


mooo?? 1 7sn-ni ? 

1V1UUUZZ 1 / JU.Ls 1 Z 


cc 
bo 


1 AC 
140 


M000?? 1 78RD06 

1V1UUV/ZZ 1 / O U . fvU 


cc 
bo 


1 AC 

146 


mooo??i 8ir*noi 

1V1UUUZZ IOI v^. 1 


bo 


I AC 

146 


MOOO?? i 8iR*ro? 

1V1UUUZZ 1 OJIJ.V^UZ 


cc 

bo 


1 AC 

146 


M000771 RdC-C \ 1 


cc 

bo 


1 AC 

146 


Mftftft???nr* A 1 ? 

IVlUUUzZZj JV^.M 1Z 


ES 


1 AC 

146 


\ziooo???^Af , -no6 

1V1UUUZZZ jHL.UUO 


cc 

bo 


1 AC 

146 


Mftftft???47 a *po? 

IVIUUUZZZt / rV. CfV/Z 


ES 


1 AC 

146 


X/fftftft7??^7 A -ROQ 
1VIUUUZZZJ /A.DU7 


ES 


146 


IVl U U UZ Z Z OZ I** . \J U J 


cc 

bo 


1 A C 

146 


Mftftft???64RG10 

IVIUUUZZZOtD.VJ I U 


ES 


146 


1V1UUUZZjOJ^.*J iz 


ES 


1 A C 

146 


M ft ft ft7 7 1f\ *\ lS i • A ft 1 
IVl U U UZZ J O J L/ . r\\) J 


ES 


146 


\/f ft ft ift 7 7 Y7 ^ A • R ft | < 
IVlUUuZZJ /_>/\.13U-) 


ES 


\ A C 

146 i 


Mftftb? 7 1 71 P ■ Rb7 


ES 


1 A C ' 

146 j 


Mbftft? 7 1 Q 1 R • F ft 1 
IVIUUUZZj" 1 D . L/U 1 


ES 


1 A H 

146 I 


\yTftftft071Q1 n*Fi ft 

muuuzzjv lu.r i u 


cc 

bo 


146 


Mftftft??4 1 (\ A * Aft7 
TV1UUUZZ4 I DA.nU / 


cc 
bo 


1 A C 

146 


Mboo??4?i rp ii 

IV1UV/UZZTZ. 1 L> . V-^ 1 I 


cc 
bo 


1 AC 

146 


M0ft0??411 A FO? 

IVl \J \J UZZt JJ r\ . CiVZ 


cc 
bo 


1 A C 

146 


MOfto??4i4nno6 

J VI U \J U Z Z T J T . VJ V W 


cc 
bo 


1 A C 

140 


M00077440RF01 


ES 


1 A c 

146 


\/lftftft7?444ri"nft 1 
1V1UUVZZ44T1J.VJU 1 


cc 

bo 


\ AC 

146 


Mftftft??467C-R t ? 
IVIUUUZZtU / \^.D 1 Z 


cc 
bo 


140 


MOOO?2489r*G04 


cc 
bo 


1 A C 
140 


M00072492CA02 


cc 
bo 


1 A C 

140 


M0007749SDH08 

IVl UU UZ Zt 7 J \J . n V/ o 


cc 
bo 


1 A C 

140 


M00077496RF1? 

lViUUUZZT7UJ3.Lf 1 Z 


bo 


1 40 


M00022499AB02 

lVluuV/iiT 7 7 r\ . U\7^» 


cc 
bo 


1 AC 

140 


MOOO?? S 11 A ■ A 08 


cc 
bo 


1 A C 

146 


M0ftft??s79rri i 

1V1UUUZZ J / 1 1 


cc 
bo 


140 


M00022597DA06 

IVlVVi V^^>-7 / i-v ./WW 


ES 


1 A c 

146 


Mftftft??Aft? A -FftQ 
MuUUZZOUZ/\.CU7 


ES 


146 


M00022615D:G05 


ES 


146 


M00022634D:C08 


ES 


146 " 


M00022640C:C12 


ES 


l46~ " 


M00022641C:~H05 


ES 


146 



f^lnnp Name 


Tube 


M000??646A t H10 


FS 146 


^7rbb7?fifi?n'Gi i 


ES 146 


MOOO??^^ RO? 


FS 146 


M bbb? ? 66RR* R 1 ? 


ES 146 


M0002267bD : H 1 1 


ES 146 


Mbbl)2267 1 B A08 


ES 146 


M00022684AC02 


ES 146 


Mbbb? ■? 7 1 \ a : nb? 


ES 147 


MObb? ? 7 1 9 A R01 


FS 147 


M00077747OF01 


FS 147 

L/O IT/ 


Mbbb?? 767R G ]) 


L/O IT/ 


M000??78Sr-Ci06 


FS 147 

L»0 IT/ 


IVl W V7V/il.Z. / 7J17.UU I 


ES 147 


M000??7QSR-ri0fi 

IVlUUv/ZZ fyjD.VJVJVJ 


FS 147 

UO IT/ 


IVIUUV/ZZ / " / D.VJV/O 


FS 147 

LvO IT/ 


. IVivuv/ZZO 1 //\.iiv/Z 


IT/ 


Mftbft??R? i r-roQ 

■ 1V1UUUZZOZ 1 v.VV7 


FS 147 

LJ IT/ 


Mooo??8?irroi 

: 1V1V/V/ l/^Z> J . V-' V 1 


ES 147 


m ob b? ? r i ftnnoi 


C/O It/ 


It 

IVlUUUZZOJtD.VJ 1 1 




C/O It/ 


f M()b022 854 A * B03 


ES 147 


j 1V1UUUZZO JOV^.AU / 


cc 147 

L«u It/ 


Mftftft??8^ftP-nft4 


FS 147 

UO It/ 


Mftftft^?88Sr*H0S 


FS 147 

1— < O IT/ 


M00b22895 A ■ H08 


ES 147 


M000??910AA06 


L/J IT/ 


m bo o? ? q? s r • A 0 8 

: IVl U \) v/ZZ ^Z J . /\v O 


FS 1 47 

L<J IT/ 


MOO b? 7 9? 8 R • P b 1 
iviuuv/ZZ"zor>.\^u i 


FS 147 

LjO It/ 


M00077910PFO? 

1V1UUI/ZZ^J>UV^.C/UZ 


FS 147 

L<lJ it/ 


Mftftft??Q18RF07 
iviuuuzz7-joo.ru / 


FS 147 

C*0 It/ 


Mbbb22968B : E02 


ES147 


M00077Q76PF04 
jvj uuuzzj/ / uv.r ut 


l-j<J IT/ 


M0bb2i2979A 


ES 147 


M00022986DH09 


ES 147 


IVl UUUZZ 77 / /A. I UU 


FS 1 47 

KjiJ IT/ 


M00021001CC08 

. lVlV/V/V/^-7 Wv I \^.\^\/0 


FS 147 

l—i <J IT/ 


m bob? iboi r • nb7 


FS 147 


M00023007AH04 


ES 147 


M00023007CE10 


ES 


\4ftftft01ft0ftP'nft» 


pre 1/17 

CO 14 / 


M00023024D:F12 


ES 147 


M00023032A:B05 


ES 147 


M00023039D:B05 


ES 147 


M00023042D:D02 


ES 147 
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Clone Name 


Tube 


M00023044B:D02 


ES 


147 


M00023094A:B1 1 


ES 


147 


XJ AAAOO 1 f\f\ A .CH 

M0002J 10uA:hlz 


ES 


147 


M00039181D:E05 


ES 


147 


M00039184A:D03 


ES 


147 


M00039184B:B09 


ES 


147 


M00039361B:h0l 


ES 


147" ' 


M00039363A:C09 


ES 


147 


M00039366C:B07 


ES 


147" 


M00039367B.H02 


ES 


147 " 


M00039371B:H06 


ES 


147 


M00039372C:D12 


ES 


147 


M00039374B:B07 


ES 


147" 


M00039374C:H12 


ES 


147 


M00039374C:H02 


ES 


147 


M00039376D:H07 


ES 


"147 


M00039377D:E12 


ES 


147 


M00039378D:H07 


ES 


147 


M00039379A:B03 


"ES 


147 


M00039380C:C09 


ES 


147 


M00039482B.G02 


ES 


147 


M00039493A:C04 


ES 


147 


M00039496B:D08 


ES 


147" 


r , _ ... _ . 

M00039496B:H09 


ES 


147 


M00039497C:C06 


ES 


147 


M00039499C:A04 


ES 


147 


x Mf\/\f\ "i n c a A/^< /""^Ayi 

M00039500C:C04 


ES 


147 


* jaaa'"* rv c a c /"*» a ^ 

M00039505C:E03 


ES 


147 


M00039508A:C12 


ES 


147 ~~! 


X lAAAOAf OOP pAI 

M00039508C:G01 


ES 


147 


M00039510C:G02 


ES~ 


147 


M00039512C:D06 


ES 


147 


\ * AAA1 AC 1 C A . A A^ 

M00039515A.AU6 


ES 


147 


M00039515D.C1 1 


ES 


147 


M00039517B:G12 


ES 


147 


* jmrtA^AC^ 1 A . A A^ 

M0003952lA:A02 


ES 


147 


K vf A A AT A. CI 1 n.Uftl 

MUUU3952 1 L):HUJ 


ES 


147 


M00039528B:B12 


ES 


147 


X *AAA1Af ^AP.HAl 

M00039529C:D07 


ES 


147 


M06039536B:"e6"2 


Is 


147 ~ 


M00039533A:C12" " 


ES 


147 " 


M00039533B.G08 


ES 


147 


M00039533D:F04 


ES 


147 


M00039535D.D10 


ES 


147 



Clone Name 


Tube 


M00039536C:ClO 


ES 


147 


* 1 (\ C\ f \ **\ f\ r 'y f /~* Till 

M00039536C:H1 1 


ES 


147 


M00039561A.B07 


ES 


147 


M00039561B:A09 


ES 


T-47 


M00039562B:G02 


ES 


147 


M00039564B:C01 


ES 


147 


X A AAA"? ACT A A .HI A 

M 0003 9 5 7 0 A : D 1 U 


ES 


147 


M00039570B:D10 


ES 


147 


M00039584L:C01 


ES 


[47 • 


M00039584C:Cl l 


ES 


147 


M00039587C:Fl2 


ES 


147 


M00039590D.D02 


ES 


147 


M0003959lC:D06 


ES 


147 : 


M00039595C:E05 


ES 


147 


X /lAAAOAf Air* T~"/"» J 

M00039597D:F04 


ES 


147 I 


M00039600A:All 


ES 


148 


M60039604B:E05 


ES 


148 


M00039604D:G03 


ES 


148 


M00039606B:D08 


ES 


148 ] 


M00039607D:E08 


ES 


148 


M00039608D:H0l 


ES 


148 


M00039609D.F07 


ES 


148 : 


M00039624A:H09 


ES 


148 


M00039624B:F12 


ES 


148 


M00039625B:G08 


ES 


148 


M00039626D:F04 


ES 


148 


\ JAAATA^TAn rAI 

M00039629B:F01 


ES 


148 


M00039629D:B04 


ES 


148 


M00039630A:C08 


ES 


148 


M00039630C:H04 


ES 


148 


X IAAA1A/" ^ 1 A k f\ C 

M0003964lA:A05 


ES 


148 


M00039641C:D07 


ES 


148 


H 4AAA*>A^ n 1 *\ 

M00039642D:B12 


ES 


148 


X 4AAA1 vl ^ rA . I TAA 

M00039642D:H09 

X * AAA*) A/" /I O /"^ TiAjI 

M00039643C:B04 


ES 


148 


ES 


148 


M00039645C:E01 


ES 


148 


M00039647A:HI 1 


ES 


1 AO 

148 


M00039 /3dU:CjU8 


ES 


148 


X JAAA1 AT /I AT™> .FI A 

M00039740B:F10 


ES 


148 


M00039752B:G08 


ES 


148 ! 


M~00639755A:B08" 


ES 


148 


M00039756B:H06 


ES 


148 ! 


M00039760B.B08 


ES 


148 "1 


M00040131B:D11 


ES 


148 : 
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Clone Name 



[M00040131C:F03_ 

M00646i3rD:Gb8 

M00040133B:"b63 

MOb040136C:F08 

M00040138B:H03 

M00040141D:F05 

M00040143A.H05 



Tube 



M00040145D:D03 
M00040147D:H1 1 



jES 148 
ES148 
lES 148 



M00040160B:A10 



M00040162A:E01 
Moo040T69B:F08 



M00040173D:B05 
Mbob40174C:EYd 
M00040174D:G03 
M00040181B:H09 
M0004018iD:H10 



M00040182D:D06 
M06040183A:F07 
M00040184C:A11 
;M00b40191A:B09 

'M6timn\A:G\\' 

M00040222D:G02 
:M00040223A:C05 
[M00040226A:H10 
M00040230A:H02 
;M00040231B:C08 
M00b40232D:B07 
M00<W6233A:H02 



M00040233C:G05 
M06040252C:C06 
M00040253C:A05 
M00040254BrC10 



M00040256A^A06 
M00040257D:H10" 
Mb004b2'6bB:D62 
M00040260C:D04 



M00040265D:C08 
M0"0b40267A:Eb6 



ES 148 
ES 148 
ES 148 
ES 148 



ES 148 
ES 148 
lES 148" 
ES 148 
ES 148 
ES 148 
ES148 
ES 148 



ES 148 
lES 1 148 



ES 148 
JES 148 
HES 148 
ES 148 
ES 148 
ES 148 
ES 148 
ES 148 
ES 148 
ES 148 
ES 148 
!ES 148" 



ES 148 
EST48 
lES 148 
!ES 148 
jES 148 
'ES 148 
!ES 148" 



iES 148 



M00040261C:F01 JES 148 



M00040262B:B06 (ES 148 
Mbbb40264D7Gd5 Is 1 48 



M00040265D:B07 iES 148 



M00040267C:C04 



IES 148 
iES 148 
TES 1 48 



done fName 


Tube 


IV A AAA/1 AO *7 1 D ■ C 1 0 


ES 


1 A O 

148 


\ >iaaa/i aoo i r'.nno 


ES 


148 


K A AAA/1 AOT5 D.U11 

M0(X)40273B:H12 


ES 


148 


\Af\nf\A AT7/1 a .nm 


bS 


1 A O 

145 


1WTAAA/1A07 A A «14I I 

MUUU4U2 /4A.ti 1 i 


ETC 

bb 


145 


maaa^aooap.liac 


ES 


1 A O 

148 


\A AAA/1 AOQ 1 n.Drtl 


re 

ES 


1 A O 

148 


\ if AAA A AO OO A . A AT 

MU0U4U282A:AU3 


ES 


148 


M00U4O286L:CU2 


ES 


148 


X >l A A A A AO O TO » r> AA 

M0()04()287L:BU9 


ES 


1 /I o 

148 


* jf aaa/i ao 0*7 rvr*A*7 


ES 


148 


X /fry A A") A AO 

M00039746C:AUo 


ES 


1 A O 

148 


X jf AAA") A*7 /I z^r'.prtn 

:M0U039746L:(jU9 


ES 


148 


M00039746C:HU5 


ES 


148 


M00039746C:HU6 


ES 


148 


M00039746U:Di I 


ES 


^ AO 

148 


\Af\(\(VM\H A Q A .CI 1 

M0UU39/4oA.r J 1 


ES 


1 A O 

148 


;M00l)39748C:r 1 1 


ES 


148 


: M00039749D:D05 


ES 


148 


iM00039761D:E10 


ES 


148 


M00039762B:F07 


ES 


148 


M00039764C:D07 


ES 


148 


M00039766A:G07 


ES 


148 


IV/fAAAOAOiC^rVlJA 1 


ES 


1 A C\ 

149 


1 IV 4AAAO A0£00 . A (\A 

;MUUU39 /o /d:AU4 


ES 


1 AC\ 

149 


;\/f AAAIAT^TP-ET 1 O 

MUUU39 / 0 /L,.b 1 2 


ES 


149 


\/l AAAO AOOA A 1 


ES 


1 A A 

149 


A A A A A 0 A *70 A O . C A /I 


ES 


1 A A 

149 


N^AAfil AQ/lOrVOA 1 

1 MUUU39942D.CU 1 


ES 


1 A A 

149 


i \ A A A A 1 ? AO A1 TJ .TT1 A 

iMUUU39943B:r 1U 


ES 


1 vl A 

149 


\ A A A AT AA/f CO. IT A A 

M00U3994:>C:r U9 


ES 


1 jIA 

149 


IV /tAAAl AA/t£O.T7AQ 

M UUU3 9 9 4 o o : r U o 


ES 


1 A A 

149 


\A AAA! Afl/1 7 A .F\A£ 


LS 


i /in 
149 


lV/TAAAIAA/l Hf^.nCll 

MUUU39V4 /CUU3 


ES 


149 


IV /f AAA1 CidA Q A .CAT. 

MUUU39945A:bU3 


cc 
bS 


1 AC\ 

149 


\A AAAlOQ/lfin-rvi 1 

;MUUU3W4oU.Ul I 


bb 


1 A A 

149 


^A/TAAAOAAC 1 A .EJA7 


bS 


1 >1A 

149 


AAAOAQC 1 Ti-R 1 0 


ho 


1 /I A 

14V 




ES 


1 /! A 

149 


: IV A A A AO A AC CO. OA /I 

I MUUU399;) jL:CU4 


ES 


149 


;M00039957C:C09 


ES 


149 


|M00039957D:A12 


ES 


149 


|M00039958A:A08 


ES 


149 


M00039958C:B09 


ES 


149 
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Clone Name 


Tube 


» jaaa^aoai . /~* i i 

M00040201C:G1 1 


ES 


149 


M00040202A:F05 


ES 


149 


M00040203A:HUo 


ES 


149 


M00040203B:AU5 


ES 


149 


X 4 A A A A AO A1 r\. U 1 1 

M00040203L):rll 1 


ES 


149 


*. AAA A A AO Ai£ A . A fi*7 

MUUU4UzUoA:AU/ 


ES 


149 


X 4 A A A A AO Ann T~\ A O 

M00040207B:D08 


ES 


149 


M00040208A:C03 


ES 


149 


%jfAAA/4A^AOD. A 

M00040208B:AU7 


ES 


149 


M00040208D:G09 


ES 


149 


ivaaaaaao 1 7rvnri7 
MUUU4UZ 1 /D.dU / 


ES 


149 


» # A A A jl A ^ 1 O /"^ AO 

M00040218C:C02 


ES 


149"" 


M00040219B:D02 


ES 


149 


% * A A A jt A^ 1 A f~X A O 

M00040219D:E08 


ES 


149 


m *AAAjtA^A1 1~V A C 

M00040291D:C05 


ES 


149 


M00040293D:G04 


ES 


149 


1 m * A A A j4 A'O A 4 I~X T"\ 1 ^ 

M00040294D:D12 


ES 


149 


M00040296D:E09 


ES 


149 


M00040298B:G02 


ES 


149 


<X *AAA ^ A^AAH ri A 

iM00040299B:F10 


ES 


149 


jM00040313C:D05 


ES 


149 


; « » a A A 4 A ^ -1 ^ r\ t*» a j« 

,M0004031 3 D:E04 


ES 


149 


I^^aaajIAOi 4 t*"\ i i a r 

?M00040314D:H05 


ES 


149 


i» * A A A J A IT * I IA1 

;M00040317A:H03 


ES 


149 


* * AAA /t AO 1 in rAO 

M00040317D:F02 


ES 


149 | 


It i AAA /I A "> 1 (1 A 10 A O 

M00040318A:B02 


ES 


149 i 


m m AAA .4 A 1 A /~i I I 1 1 

M000403l8C:Hl l 


ES 


149 1 


M00040320D:F02 


ES 


149 j 


M00040323B:C12 


ES 


149 


M00040323C:G11 


ES 


149 


"» *AAAjt/\1^^" A T"* A 4 

M00040326A:F04 


ES 


149 


M00040327B:G06 


ES 


149 


M00040332D:B05 


ES 


149 


X 4 AAA /1A111 1~\ .pAT 

M00040333D:uUj 


ES 


149 


X f AAA /| A1 O vH""\.OAO 

M00040334D:B02 


ES 


149 


X 4AAA4AOO /1 1~\ . 

M00040334D:C07 


ES 


149 


A 4 A A A /l A O J O I~> . P\ 1 O 

M0004U3 4z B : U 1 2 


ES 


149 


\ A AAA/1 A1 ^in. A AO 


ES 


149 


\ jf AAA /I A1 yl £ A ./"^ 1 1 

MUUU4U34oA:Ll 1 


to 






ES 


149 


M00040349D:B09 


ES 


149 


M00040351B:F02 


ES 


149 


M00040351D:A11 


ES 


149 


M00040364A:E05 


ES 


149 



Clone Name 


Tube 


M00040366A.B01 iES 149 


MUUU4UjOoA. A 1 Z 


CC 1 AO 


X jf AAA A AO A .TA1 

M00U4O368A:rOl 


t:c i aci 
hb 14V 


X jf AAA A AO /7 O 1~A . r* AA 




MUUU4UJ / lCrlUj 




X jf A A A /lAOTC/^.DAiT 


re HO 

bo 14y 


MUUU4UJ /ov^.uUZ 




x jiaaa/iao nr 1 . AT 

M00040377C:G07 


CC 1 ACk 

bb 149 


XjIAAAjIAOOO A I 1 AO 

M00040383A:H02 


trc i ac\ 
bb 149 


X lAAA/IAIOI 1~\ /~*f\A 

M00040383D:C04 


bb 149 


X 4 A A A A AO O C 1™\AO 

M00040385C:D02 


CC 1 A A 

bb 149 


X 4AAA/4AOOX" A A AO 

M00040386A:A02 


CC 1 /iA- 

ES 149 


X lAAA/IAIOir 1 rAl 

|M00040387C:E07 


C 1 A A 

bb 149 


X < AAA i A1 O 1A I IAf 

M00040387D:H05 


CC 1 AC\ 

bb 149 


X 4AAA/1AOAA A I IAO 

M00040390A:H02 


CC 1 A(\ 

bb 149 


v JAAA^AIAAn ¥~» AO 

;M00040390B:F02 


r"C 1 vl A 

bb 149 


m # a A A J A ^ A 1 A 1~V 1 A 

M0004039lA:DlO 


rn i ^a 

ES 149 


: X 4 A A A A AO AO Ti f f A 1 

;M00040392B:H0l 


r*c 1 A A 

bb 149 


M00040392C:B12 


rn t ^A 

ES 149 


.»jfAAA>(AOAjt A f~\ A A 

M00040394A:D04 


C 1 A A 

ES 149 


'» * A A A J A ^ A /~ T"» I~^v 1 1 

M00040395B:Dl l 


r^c 1 A A 

ES 149 


X/TAAA/IOCO/i A . A AC 

; M00U42534A: A\)5 


CC 1 AA 

bb 14V 


;M00042538B:E06 


T~ ' o 1 A A 

ES 149 


; X 4 AAA ylOC /OP.rA/l 

:M00042543C;G04 


CC 1 AA 

bb 149 


: X 4 AAA A O C C O A .TAI 

iM00042558A:r03 


CC 1 AO 

bb 149 


M00042560A:Fl2 


ES 149 


A 4 AAA /i^c/rr 1 . A AO 

M00042565L:A08 


CC 1 AA 

bb 149 


;X IAAA>l^f / /"/^ /~i A £" 

;M00042566C:C05 


bS 149 


x 4 AAA /I O .LJ 1 A 

M00042567B:H10 


CC 1 A A 

bb 14V 


X 4 A A A A O AO T\ T^A/I 

M00042693D:E04 


CC 1 AC\ 

bb 149 


x yf AAA/IOiTA^D .L7AC 


CO 1 AQ. 

bb 14V 


A jf AAA/1 OiCmr^./^AT 


CC 1 CA 

bb IjU 


i X A A A A A 1/"AOr\.A 1 A 

;M00042698D:D10 


CC 1 CA 

bb 150 


; a a aaa /i i^non.c A 1 
|MU0U42o98U:bUl 


CC 1 CA 

bb IjU 


HIAAAymAin.r' AO 

M00042702B:G02 


CC 1 CA 

bb IjU 


^ A/f AAA/f T7A/( A -17AQ 


bb IjU 


: x 4 AAA/l OT 1 1 F> . All 

\MUUU42 71 1B. A1 1 


CC 1 CA 

bb IjU 


IVlUUUnZ / 1 //\.\w/U/ 


F<s 1 SO 


M00042737C:H04 


ES 150 


M00042740A:E09 


ES 150 


M06042742D:b05 


ES 150 


M00042887C.D07 


ES 150 


|M00042895A:D10 


ES 150 


iM00042895C:G01 


ES 150 




PP01598.002 / 200130.512 



Clone Name 


Tube 


M00042902D:B08 


ES 


150 


M000429"04B:E07 " 


ES 


150 


M00042905A:F11 


ES 


150 


M00042905B:C03 ~ 


ES 


150 


M00042905D:D02 


ES 


150 


M00042347D:H11 


ES 


150 


M00042348B:E05 


ES 


150 


M00042349D:D67 


" ES 


1 50 " 


M00042431B:G08 


ES 


150 


Mb66"4243TcTF0f" 


ES 


150 " 


M00042431D:C10 


ES 


150 


W604'243TD:E02 


ES 


150" 


~M00042435A:Air" 


|ES 


150 


M06042436B:H09 


ES 


150 


M00042437A:D04 


i:s 


150 


m66642439B:B03 


ES 


150 


M00042439B:D03 


ES 


1 50 


M00042440B:E09 


ES 


150 


M01o42463A:F69 


' Ies 


150 


M00042470C:E05 


ES 


150 


:M00042511A:H04 


ES 


150 


^000425 i5C:F~08 


:ES 


150 


M0604275IC:Ci2 


F.S 


1 50 


M00042752A:E11 


ES 


150 


M00042756B:FlV 


ES 


150 


,M00042756D:A10 


ES 


150 


M00042759B:G1 1 


ES 


150 


M00042760A:C12 


ES 


150 


M000427"65C:b04 


: ES 


150 


M00042767B:G10 


ES 


150 


M00642769C:Eb9 


ES 


150 


M00042770B:B12 


ES 


150 



M00042788C:F11 
Mbb042790C:C07" 



M00042792A:H01 
M00042797D:D10 
M00042799b:F68 
M00042800A:A03 



Clone Name 



Tube 



M00042770C:C04 ES 150 
M00042771C:F06 ES [50 
m66042774C:C05 ~ ; ES 1 50 
M0004278rA:Ab7 ES 150 
M6b042784A:H06 {ES 150 
ES 150 
ES 150 



ES 150 
ES 150 
ES 150 



IES 150 



M00042802C:C04 ES 150 



M00042806C:F0_7 

M00642807D:b05 

M00042823C.C02 

Mbd04283bB:E02 

M00042839B:B11 

M00042841D:H07 

M00042849D:F11 

M00042852B:A03 

M00042852C:A01 

M00042856B:Hb2 

M00042352C:H03 

Mb0b42352D:Cbr' 

M00042352D:G09 

iM00042448A:C09 

M0b042"44"8C:H12 

!m00042453B:G09 

M000425i8D:Ab6 

Jm00042518D:D04 

i M00043296B :G09 

M00b43304B:D05 

M000433b4C:b02 

M00043305B-.G02 

;Mbb043306C:B03 

M00043306D:B07 

M00b43310C:G06 

M00043311C-.E03 

M00043312C:E08 

M00043320B:A07 

r M00b43324D:Hli 



4 



ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES"f5b 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 
ES 150 



IM00043328D:H02 
|M0bb43l32C:Gb4" 



M00043334B:AI0 
;Mb0043338B:A03 
;M00b43338B:CH 
M00043339A:F1 1 " 



M00043340B:H08 



ES150 
ES 150 



ES 150 
ESi50 



ES 150 
ES150" 
ES 150 



M00043344D:E04 


ES 


150 i 


|M00043345C:A06 


ES 


150 | 


M00043346A:G01 


ES 


150 | 


[Mbbb43350biin 


ES 


151 ' 


|M00043351D:A11 


ES 


151 j 


M00043352D:B05 


ES 


151 


;M00043352D:C03 


ES 


151 


|M00043359B:D10 


ES 


151 
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Clone Name 


Tube 


M00043359C:G01 


ES 


151 


M0004336lB:A0l 


ES 


151' 


M00043366A:A02 


ES 


15f 


M00043366C:H05 


ES 


T51 


M00043367B:A08 


ES 


151 


M00043368C:F09 


ES 


151 


M00043370B:C08 


ES 


151 


M00043372C:G05 


ES 


15l """ 


M00043377A:C03 


ES 


151 


M00043378A:HlO 


ES 


151" 


M00043379D:H02 


ES 


151 


» /AAA i ^ ^ y— i i ^ 

M00043383C:F12 


ES 


151 


H jf AAA A O TOin. A AO 

M00043383D:A02 


ES 


151 " 


M00043384B.B02 


ES 


151 


M00043386A:B08 


ES 


151 


M00043389C:E03 


ES 


151 


M00043389D:D07 


ES 


151 


% * A A A A ^ ^ A 1 A 1 A 

M00043391A:C10 


ES 


151 


M0004339 1 A:G08 


ES 


151 


M00043392D:Cll 


ES 


151 


M00043393A:B08 


ES 


151 


M0004340lD:G08 


ES 


151 


M00043402C:D08 


ES 


151 


M00043405A:Dll 


ES 


151 


M00043405C:Gl2 


ES 


151 


M00043405C.G02 


ES 


151 | 


M00043406B:G12 


ES 


151 


M00043407C:E05 


ES 


151 | 


» IAAA/11 /« Ann |~"v 1 i 

M00043408B:D1 1 


ES 


151 


% * A A A A ^ A f\ A T"\ /\ ^ 

M00043409B:B03 


ES 


151 


M00043410C:A09 


ES 


15 i " 


M000434llB:D08 


ES 


151 


M00043411D:H06 


ES 


151 


M00042584B:C10 


ES 


151 


M00042623D:D07 


Is 


151" 


% IAAAJ1/1f A 1""* A /I 

M00042625C:B04 


ES 


151 


« IAAAiiA/^/n r\Afl 

M00042626B:D08 


ES 


151 


M00042627C:DOl 


ES 


151 


M0004263UA:LU5 


CO 


1 C 1 
1 D 1 


l vi \j \j u h z y j j . yj \j j 


ES 


151 




ES 


151 ~ 


M00042T60D-H08 


ES 


151 


M00042962D:C05 


ES 


151 ; 


M00042964D:A03 


ES 


151 



^lone iNoine 


1 UuC 


MUUU42yoob.rU/ 


PTC 1 < 1 

bb I j I 


JVlUUU42yooCbUo 


EC 1 C 1 

bo lji 


\yf AAA/f OA7Ai°. A CIA 

MUUU4zy /UL. AU4 


bb I j I 


lYlUUU4zy /U^.li 1 U 


bo lji 


IvlUUU4Zy /OA.HU4 


bo I !) I 


\A AAA/1 0O7OQ ■ C AO 

MUUU4zy /yt). buz 


bo ID 1 


\/f AAA/1 7AC 1 D. P» t 7 
MUUU4ZyiSI D.1J1 1 


bo I j I 


\A AAA/1 OGQir"' A 1 1 

MUUU4zyo3L. A 1 1 


bo JjI 


\/f AAA/f OOC^i^./^A/i 

MUUU4zy oiL .OUo 


ETC t < 1 

bo 1 j 1 


\a aaa/1 onOiCr^.r 1 1 1 
MUUU4zy5oL.\j 1 1 


nc ki 
bo 1M 


\/f aaa/1 moo a .van 
MUUU4zyo5A.rUo 


bo i j 1 


\4AAA/nAA7D .F\A^ 

MUUU4zyy /r3:UUo 


CC 1^1 
bo 1-5 1 


\A AAA/1 7AA<? A .Crtl 

MUUU4zyyoA:hUi 


CC 1^1 

bo ID 1 


X/fAAA/innAO A ./^A/1 

MUUU42yy5A:uU4 


CC 1^1 

bo 1M 


\Af\f\f\A 1 A A 1D.U1 A 

MUUU43UU I r3:ri i U 


bo 1M 


\ A A A A A 1 A A 1 F\ • n A 7 
MUUU4JUU 1 u.uyjD 


CC 1 CI 
DO 1 J I 


\A AAA/1 TAAO A .CAC 

MUUU43 UUz A : cUj 


CC 1 CI 

bo I !> I 


M00043003C:D08 


CC KI 

bo 1M 


MUUU43UI I A:H12 


CC 1 CI 

bS 1M 


h AAA A /1TA1 C A .Ul A 

MU0U43U1dA:H1U 


CC 1 CI 

bS Id I 


IV >f AAA A O AO OA .CIO 

MuUU430zzA:blz 


C C 1 C 1 

bo 1 3 I 


A A AAA/1 "5 AOiC/^ 1 ,HAO 

MUUU43U2oC:L>U/ 

IVylAAAylO AOO A .PAC 

MU00430zoA:(jU5 


CC KI 

bo 1 J I 


CC K 1 

bo I j 1 


\if AAA/IO AO C\(~* . A AiC 

MUUU43UzyL:AUo 


CC K 1 

bo IM 


MUUU4jUjZCA1U 


CC 1 C 1 
CO I J 1 


\ yfAAA/1 1 AO A X~~\.(~^f\ 1 

MUUU43U34U.LU1 


rc KI 

bb 151 


\ /f AAA/1 O AT /CP.CAC 

MUOU43U3oC:bO-> 


C C K 1 

bo 151 


N 4 A A A /1 1 AT £ ry P AQ 
IVlUUU4JUj0lJ.L^Uy 


CC K 1 

bb l j 1 


\ yfAAA/1 T A A AD . OAT 

MUUU43040B:BU7 


C C 1 C 1 

bo 151 


K A AAA/I OA/1/1D. A lO 

MUUU43U44B: A 1 z 


CC K 1 

bo 1 5 I 


\ A* AAA A O A A AT\. A A A 

MUU043044L>:AU9 


C C K 1 ■ 

bb 151 


AAAAA/IO A/1 CT^.P 1 O 

MUUU43U4jD:01z 


CC KI 

bb 15 1 


A/iAAA/11A/iAr\-m 1 


bo ID l 


\ A AAA/10 A^AA./ 0 1 O 

MUUU43UoUL).U 1 z 


CC Kl 

bb 15 i 


\ AAA A/11 A/CiCD -U 1 1 

MUUU43Uoori.Hl 1 


CC KI ^ 

bb 151 


N/fAAA/10 A£7TVr.1 A 

MUUU43 Uo /U.U 1 U 


CC I C i ~ 

bb 151 


MUUU4J 1 Z )A.D 1 1 


CO 1 J l 


K/fAAA/ll 1 1<r- A t 1 

IV1UUU43 1 ZDL.A 1 1 


CC K 1 

bb 151 


MAAA40^1 1 A 'AHA 


CC 1C1 

no i j i 


IVIUUUH-ZO 1 1 L/.D I Z 


F<s 1S1 

Do Ul 


M00042612D^ 


ES 151 


M00042614B:B05 


ES 151 


MM043073A^C12~ " 


ES 151 ] 


M00043078D:D04 


ES151 
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Clone Name 


Tube 


* H (\f\f\ A T AO 1 l~\ „ r A C 

M00043081D:r05 


re 1 CI 

bo 151 


M00043087B:G07 


re i ci 

bo 151 


M00043093C:G1 1 


cc i c 1 
to 1)1 


R/fAAA/t 7 noc a -irrio 
MUUU4 J \)y D A . r \)y 


c c i co 

Cj 1 ->Z 


M00043096A:<jiO4 


trc i co 
bo 15z 


A jf A A A A O 1 AO A . CA£ 

M00043108A:r06 


rc i CO 
bo 15Z 


hjiaaa/io. i nnr'.r'fli 
M0UU431U9L.UU1 


pc 1 CO 
OZ 


A jf AAA /) 1 1 1 1 n. A Aft 

M00043131B:A09 


bo Ijz 


A 4AA A/1 T 1 1 


PQ 1 C7 
CO lJZ 


A IAAA/11 1 O OT\ T^i 1 1 

M00043138D:B11 


bo 152 


AjTAAA>40 1 ^ t n . A 1 a 

M00043143B:A10 


CC 1 co 

bo 15z 


M00043148C:A09 


CC 1 CO 

bo 15z 


M00043154A:B07 


CC 1 CO 

bo 152 


» I AAA yl ^ 1 A ^ A p> A O 

M00043l62A:B08 


CC 1 CO 

bo 152 


M00043162D:C12 


CC 1 CO 

bo 152 


m A A A A j1 -"I 1 /""A /"> T~"> 1 >"\ 

M00043164C:E12 


CC 1 CO 

bo 152 


» * AAA A ~> 1 /^f n.PAl 

M00043 165B.UU1 


CC 1 CO 

bo 15z 


M00043 1 73D:G03 


CC 1 CO 

bo 152 


A >f A AA A T 1 O /I A .UAO 


CC 1 co 
bo I5Z 


, » IAAA /I 1 1 0*7 A /""'A /I 

!M00043187A:C04 


CC 1 CO 

bS 152 


1 f A AA jl ^ 1 A 1 A A A 1 

( M00043191A:A07 


rc 1 CO 

bo 152 


M00043I92C:Bl2 


rc i co 

ES 152 


» IAAA ill^AA A I IAA 

|M00043200A:H09 


rr i co 

bS 152 


1M00043200B:C08 


CC 1 CO 

bo 152 


jM00043202B:F01 


rc 1 CO 

bS 152 


M00043203A.B09 


ro i co 

ES 152 


M00043210C.E05 


CC 1 CO 

bo 152 


« IAAA ? 1 A . r A 1 

M0004321 1A:F0I 


CC 1 CO 

bo 152 


M00043213B:B12 


T""0 1 CO 

bo 152 


% *AAAj1'^'^'1 C A ¥~\AO 

M00043215A:D02 


T^C 1 CO 

bo 152 


M00043220B:C04 


CC 1 CO 

bo 152 


X lAAA/IOf A1 T~\ f I AO 

M00042591D:HO3 


CC 1 CO 

bo 152 


a AAAA/IO f A1 A .TJ1 A 

M00042592A:H1U 


CC t CO 

bo 15Z 


M(J0042593A:LUZ 


bo 15Z 


\ lAAAHOf AOP.r'Ai 

M00042593C:O06 


CC 1 CO 

bo 15z 


•» iaaa /^rnr a . a 1 i 

M00042595A.A1 1 


CC 1 CO 

bo 152 


A JAAA J O C AC A .DAI 

Mu(K)42595 A:bu 1 


ETC 1^0 
bo 15Z 


■\ a AAA/iocn^D-cn/c 
MUUl)4259ob:rUo 


E,0 IJZ 


a jfAAAx^r n^/^.r\Ai 

M00042596C:DO/ 


CC 1 CO 

bo 15z 


M00042597B-E12 


ES 152 


M00043416C:A02 


ES 152 


M00043417C:D05 


ES 152 


M00043418A:H10 


ES 152 


M06643419D:Ai6 


ES152 





Tube 


\a fi(\f\A i a o s n • no, r 

M UUU'f J HZ 0 Lf . VJU 0 


ES 


1 CO 

152 


MUUU4j £ tJUD.^UZ 


ES 


1 CO 

152 


iVlVVvH JHJ l U. DUO 


CC 

bo 


1 CO 

15z 


lVluUut J H J J D .\JU 7 


ES 


1 CO 

152 




ES 


1 CO 

152 




ES 


1 CO 

152 


\A n n i ft A \ A A ft C • R ft 7 


ES 


1 CO 

152 


A>f ftftft/1 1/1/1^1^ ■ P 1 7 


rc 

ES 


1 CO 

152 


\/f ft ft ftzH zt A 7 A * P ft 7 


ES 


152 


A/fftftft/11A/10 A 1 7 


ES 


152 


A 4ft n ft /17 /l ^ft P * P ft ^ 


ES 


152 


\A(\C\(\A 7 /l CI R • RftQ 


ES 


152 


"\4ftnft/l'J/lCfi A -R17 
MUUU434joA.o 1Z 


ES 


152 


N/fftnft/i'j/i^ i n*pft7 


ES 


152 


\4ftftftA1AAl n-Pft^x 
MUUU4 j40 1 JJ.CiUO 


ES 


152 


N/lftftftA1AACR-Uft7 
MUUU4j40Jt>.nUZ 


ES 


152 


\Af\C\f\A1A ACP- A ft^ 


ES 


152 


\ yf nnn/i 7 a a c ■ p fto 


ES 


152 


N>fnnn/i7/i7^ a -Pft7 
MUUU4j4 /OA.rU / 


ES 


152 


IVlUUll434oJD.u 1U 


ES 


152 


A4AAA/11/10,1 r^'PA/l 

JVlUUU434y l^.rU4 


ES 


152 


A4AAi^/t 7/1Q0 A -PHI 


ES 


152 


\Af\(\c\A i c i 7r»*r;ftff 
MUUU4Jj 1 JU.OUo 


ES 


152 


ivlUUU4j J 1 DD.rlWV 


ES 


1 CO 

152 


\AftftftAlC 1 SR-nftA 


ES 


152 


\yiriMAA7C7AR-ni ft 
JVJUUU43 jZOd.U 1 U 


rc 

ES 


1 CO 

152 


iviuuw 4 tj jz /v^.cuy 


r o 

ES 


1 c o 

152 


\Aft ft ftk 7 C 7 8P ■ A ft7 
M U U U 4 j Z> Z o . /\ U Z 


r c 

ES 


1 CO 

152 


K4ftftftA7A1 ^R-Pft7 


r c 

ES 


1 CO 

152 


N/tftftftAIAl AP- Aft^ 


r c 

ES 


1 CO 

152 


MftftftA7A77n-PftO 


ES 


152 


A^ftftftA 7A7A A -P 1 ft 


CC 

bo 


1 CO 

152 


Mftftft/l^fi^^r-Pl 1 


r c 

bo 


1 CO 

152 


MftftftA^rV^R-Pftfi 


ES 


1 CO 

152 


\AftftftA7A77P-Plftl 


ES 


1 CO 

152 


\AftftftA7 ^7 R A • HftA 
1V1UUU4 J OJ 0 A . LfyJO 


CC 

bo 


1 CO 

152 


_ 

IVIV/ \J\JH J O^tV/V^ . jGU J 


r c 

ES 


1 CO 

152 


\AftftftA7 AAft A -Pft7 


r c 

ES 


1 CO 

152 


\AftftftA7£AQR'Fft7 


ES 


152 


MU0001338C:BU2 


ES 


153" 


M06001338C:F05 " 


ES 


153 


|M00001338D:D01 


ES 


153 


|M00001340D:F07 


ES 


153 




ES 


153 




PP01 598.002/ 200130. 



Clone Name 



M0J)001346B:G11_ 

M0006i348BjB03 

'M00001349C:B04 



Tube 



ES 153 



IES 153 
1EST53" 



M00001351B:E11 jES 153 

M00001352B:B02 ES 153 

M00001353A:H07 ES 153 

M00001353C:A05 ] ES 153 



M0000T353D:Eb5 
M00001356D:E06 
M6o6or358ATE08 



jES 153 
ES 153 



M6o6¥359A:H10 



M00001361A:C12 
M00001361B:A12 



i ES I_53 
ES 153 
HS 153 
ES 153 



M00001362A:F09 
m66661364A:C09 
M06W364C:H10 
M00001368A:A08 



ES 153 



IES 153 



IES 153 



M00001368A.B07 
M06001368A:C02 
m60001369A:G06 
M00001374A:B02 



M00001374C:B10 

M00b6l375B:D04~ 

m60O61378C:EI0 

M00001379A:F09 

m60001382D:A07 

M06001382D:H08 

M0000i384A:A07 



M00001385A:E07 

M0000i386BiFfT 

M0600i387A:C12 



|ES 153 
ES 153 
ES 153 

"ES 153" 
ES 153 

!ESl53' 

jES 153 " 
ES 153 
HS 153 
ES 153 

:ES 153 
ES 153 
ES 153 



M00001387B.A11 



M00001389B:E10 



Mb600l389D:D06 



M00001390D:E02 
MOOOO 13910:1)63 
M66M1393BTC03" 



ES 153 



ES 153 
ES 153 



ES 153 



ES 153 



M00001393C:E08 



M00001393C:F04 

M000l"393D:EW" 

M6o66l396B7BO~f 



M00001396B:B12 



M00001396D:H02 
m66001397C:H68 



ES 153 
EST53 
ES 153 



ES 153 



ES 153 



ES 153 

ES 153" 



ES 153 



ES 153 



ES 153 



Clone Name 



Tube 



M00001399BjB01 _ , ESI 53 
"M066my99C:A~01~ ES 153" 
MOOOO 1 403C:B03 " ES 1 53 
MOOOO 1 403 D:C 12 ES 153 
MOOOO 1406B:H09 ^ j ES 153 
M0b001406D:F06" " ES 153 
! M00001410A:G10 _ ; ES 153 
MOOOO 14 i6B:A05" ~IEST53~" 



MOOOO 1 42 1B:E07 
MOOOO 1422B:D06 
M06001424B:H66" 

MOOOO 1426C:F06 



jM00001428B:C10 
MOOOO 1429B:G05 



M00001430B:C01 
MOOOO 1433B:E02 
MOOOO 1442A.F08 
M00J)b"l"442C:GT2 
'¥0000r444B:E04" 
MOOOO 1444C:D 11 
M0006l445B:F06 
:M00bdl449B:Hi0 
"Mbd00i451C:E10 
M00001460C:E10 
M00001461D:B10 
M00001461D:C10 
MOOOO 1465C:A02 



|M00001466B:F03 



M00001467C:D04 



ES 153 
ES 153 
ES 153 

"ES 153~ 
HS 153 

; ES 153 
ES 1 53 

"esT53 
HS 153 
ES 153 
ES 153 

tlS 153 
ES 153 
ES 153 
ES153 
ES 153 
ES 153 
HS 153 
ES 153 
ES 153 
ES 153 
ES 153 



MOOOO 1477D:G09 ES 153 
M000bf485C:Fd6 
MM0i488C:A03 



ES 153 



ES 153 



MOOOO 1497C:F 10 ES153 

M0006l503B:H10 ES 153 
M0600ll0jB:bir : ES'T53" 
MOOdd 1 5 1 2D: F08 ES 153" 



M00001518B:D10 ES 153 



M00001528C:C03 



M00001532A:G08 
M060^533CTGir 



ES 153 



ES 153 

ES" 153" 



M00001533D:A01 



M00001534C:E07 



ES 153 



ES 153 



M00001535B:B10 ! ES 153 



L' 
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Clone Name 


Tube 


M000O1535B:EO2 


ES 153 


M00001537B:H10 |ES 153 


M00001538B:A07 -ES 153 


M00001539C:F12 


ES 154 


M00001542B:F09 


ES154 


M00001543C:A08 


ES 154 


M00001544B:B05 


ES 154 


M00001544B:E06 


ES 154 


M00001546B:C11 


ES 154 




ES 154 


M00001550A:H06 


ES 154 


M0O0O155OD:Bll 


ES 154 


M00001551D:D01 


ES 154 


M000O1551D:HO9 


ES 154 


M00001554C:G10 


ES 154 


M000O1558A:EO6 


ES 154 


M00001559A:H09 


ES 154 


M00001561D:H04 


ES 154 


M00001562B:B02 


ES 154 


M00001562D:B07 


ES 154 


Md66or565ATH05 ~" 


ES 154 " 


Md00Oi568C:AO3 


ESi54 ; 


M000O157OA:BO7 


ES"l54 


M000O1591B:HO5 


ES 154 


M00001596A:D02 


ES 154 


M000bl6OOB:GOl 


ES 154 : 


M00001605B:B05 


ES 1 54 


M0b0bl606B:Al6 


ES 1 54 


M00001606D:D06 


ES 154 


M00001607A:E04 


ES 154 


M00001607D:H09 


ES 154 


M00001609D:C11 


ES 154 


M00001616D:F03 


ES 154 


M00001617C:F10 


ES 154 


M00001618C:D01 


ES 154 


M00001619C:H09 


ES 154 


M00001620B:A03 


ES 154 


M00001623D:A10 


ES 154 


M00001623D:E12 


ES 154 


M00001624A:C01 


ES 154 


M00001625D:B04 


ES 154 


M00001626A:D07 


? ES 154 


M00001632C:A10 


ES 154 


M00001633D:C11 


ES 154 



Clone Name 


Tube 


M00001637D:C12 


ES 154 ; 


M00001648A:D10 


ES 1 54 1 


MbWl661D:F06~ 


ES 154 ""■ 


M00001663A:A12 


EST54 1 


M00001671 A:H10 


ES 154" ; 


M00001671C:F03 


ES 154 


M00001675B:D06 


ES 154 


M00001677B:H08 


ES 154 ! 


M00001680A:A01 


ES~154 I 


M00001683B:F11 


ES 1 54 \ 


M00001684D:E04 


ES 154 | 


M00001686B:H01 


ES 154 | 


M00001686D:F06 


ES 154 j 


M00001688B:B11 


ES 154 j 


M00001692C:C04 


ES 154 ; 


M00001771B:E06 


ES 154 ; 


M00003746C:E1 1 


ES 154 ! 


M00003749C:C08 


ES 154 j 


M00003753A:C11 


ES 154 i 


M00003758B:D07 


ES 154 i 


'Mbobb375~8B:F06 


ES 154 ! 


Mbbbb376bc:Grb" 


ES154 1 


M00003761B:B02 


ES 154 ' 


M00003763A:B02 


ES 154 


M00003763B:B10 


ES 154 ' 


M0bb03764A:H09 


ES 154 ; 


M00003764B:F11 


ES 154 | 


M00003764B:H11 


es 154 ; 


M00003764D:F07 


ES 154 | 


M00003768D:D08 


ES 154 | 


M00003770C:Ai6 


ES 154 | 


M00003771D:A03 


ES 154 | 


M00003773A:F10 


ES 154 ; 


M00003780A:G01 


ES 154 j 


M00003782A:B02 


ES 154 i 


M00003785D:F07 


ES 154 i 


Mbbbb3787D:Ai0"" 


ES 154 1 


M00003808A:F1 1 


ES 154 I 


Mbb003808B:Eb7 


ES 154 | 


M000038i2C:A03 


ES 154 j 


M00003814A:G05 


ES 154 I 


M00003819B:B01 


ES 154 | 


M00003820B:F11 


ES 154 | 


M00003821C:E12 


ES 154 I 




PP01598.002/ 200130. 



PM /"\r*p» "Mott*p> 
V^lUIlC INaHlC 




\yfAAnni277P- a aq 

1V1UUUUjoZZ*^./\U:7 


FS 154 
co i jh 


MUUUUjoZZIJ.AUZ 


fs i 54 

Co I 


Monoo^jmR* AOfi 

1V1UUUU J O^J D.AUU 


FS 154 


IVIUUUUjoZ JrY.ri 1 u 


FS 154 

Co 1 Jt 


M00001 82 8 A- IT )] Tl 


ES 154 


MftnnniRi or rofi 

1V1UI/UUJ OJUlJ.\^l/U 


FS 154 


M 00001 XI or - DO?" 


ES 154 


MOoooiJtt7r-ni 6 


FS 154 


MOOOlfn R1QP • HID 


FS 154 

l^O 1 JT 


\4AAAni°.47n-ni i 

1V1UUUUJ04Z1J.JJ 1 1 1 


FS 154 


imaaaai 847 rv woo 
MUUUUjo4zij\riuy 


FS 154 

CO 1 J4 


MAAAniSA^ A -PA7 
MUUUUj54 jA.LU / 


F<s 1 SS 

Co l Jj 




FS 1 55 

Co 1 J J 


\4AAAA7 8.47 A * 14A4 
1V1UUUUjo4 / A.nUH 


FS 155 


lviUUUUjo4o*^.VJU:7 


FS 155 

DO IJJ 


iViUUUUjoj lo.AUi 


CO I Jj 


TV4AAAnis^4R-FA7 
IVlUUUUJoj4D.rU/ 


f<; i ^s 

Co I jj 


ft ^ <.p ■ FA7 

MUUUUjoj JL.rUZ 


F<s 1 SS 
Co 1 JJ 


lV/f Afiftfn fi£4 A • F 1 7 
MUUUUjoo4A.C 1Z 


Co 1 J J 


\4 aaaa^ q°.7P'FAQ 
MUUUUjoo /txuy 


FQ 1 S<\ 
Co 1 jj 


MUUUUjoooD.rUy 


Co 1 JJ 


ivytAAAmcGi n-uno 
MUUUUjoV 1 o.rlUZ 


Co I jj 


\a AAAA^QGftp* Am 


CC 1 CC 
CO 1 JJ 


' x/iAAAAiQAAP-m 7 

; IVIUUUUjVUUU.U 1 z 


Co 1 JJ 


AAAfnoA^ a -p A7 
* iviuuuujyuo/\.v^uz 


pc 1 S5 

CO 1 J J 


JV/fAAAA'SQJ IP* A HO 

Muuuujyt iL^.Auy 


FQ 1 SS 
Co I jj 


Monno^oi 4 A * A OR 


FS 1 55 

CO 1 J J 


maaaaigi ^p-m n 

MUUUUjV 1 jU.IJIU 


Co I jj 


maaaaiqi ^p-par 

IVlUUUUjy l jV^.vJUo 


FS 155 

Co l JJ 


i M AAAA7Q 1 £ A *F A4 


FS 155 

CO 1 JJ 


lvluUUUj"ZO/\.r 1 1 


CO 1 J J 


1V/IAAAA'*Q'*<\R'RA1 
MUUUUJV-) JD.OU 1 


FS 1 55 

CO 1 JJ 


MAA00101RP-A0S 


FS 155 

CO i J J 


MAA001Q4? AD01 


FS 155 

L/O 1 J J 


M00001Q5RPHOR 


FS 155 

i^O I J J 


M000010S0D' AOS 


FS 155 

JuO 1 J J 


M00001Q60DC12 


Jakj 1 J J 


M000010^n*F0 1 


FS 155 

L»0 1 J J 


M00001Q65DD1 1 


ES 155 


MUUUUJyooL.LrUJ 


re 1 cc 
Co Jjj 


M00003970D:H07 


ES 155 


M00003972C:F07 


ES 155 


M00003974C:EI1 


ES 155 


M00003974D:E02 


ES 155 



Clone Name 


Tube 


M00003979B:A04 


ES 155 


M00003980D:C06 


ES 155 , 


M00003985D:B02 ; ES 155 


*M00003988D:B01 ES 155 


M00003991A:Cil 


ES 155 


M00003993C:D07 


ES 155 


M00003993D:B03 


ES 155 


M00003994A:B10 


ES 155 1 


M00003996B.U07 ES 155 


M00003998B:GlO 


O ICC 

ES 155 


M00004028B:F10 


ES 155 


M00004029D:A01 


ES 155 


M00004031C:G06 


ES 155 


M00004036B:Al l 


ES 155 


M00004036D:C12 


l" 1 C 1 C C 

ES 155 


M0000403 8 A : A04 


o ire 

ES 155 


X IAAAA ^A/I^n. All 

M00004042B:A1 1 


IT 1 1 C C 

ES 155 


M00004047C:B09 


ro i cc 

ES 155 


\ / AAAA ^A/l Tr\.r 1 o 

M00004047D:F12 


IT 1 1 C C 

ES 155 


M00004053D:F09 


f O 1 c c 

ES 155 


\ 4 AAAA A f\ C A A |~^v (\ O 

M00004054A:D03 


r'O 1 c c 

ES 155 


h 4AAAA A c\c c . n 1 A 


cc tec 
Co 15 j 


M00004055D:D05 


trc i c c 

ES 155 


\ A A AAA A AC 1 T~\ . A 1 


CC 1 c c 

bo 155 


H 4AAAA/1 A/C 1 D.CAC 


CC 1 cc 

bo 155 


\ jf AAAAyl A/TTn. A AO 

M00004UozlJ:AUz 


CC 1 cc 
to Oj 


MUUUU4UoolJ.Vj; 1 U 


CC ICC 

Co Ijj 


MUUUU4UD /d:L)U3 


CC 1 cc 
Co Ijj 


MUUUU4UoUCCU4 


CC ■ 
Co Ijj 


H M t\ A A A A AO C A . T T A 1 

M00004085A:H01 


ES 155 


M066b4085B:H62 


ES 155 : 


M00004087C:E02 


ES 155 ; 


M00004093A:C03 


ES 155 


M00004096D:F02 


ES 155 


m6o604102A:E03 


ES 155 i 


M00004103C:E10 


ES 155 


M00004104A:A12 


ES 155 j 


M00b64HdD:iF09 


ES 155 ] 


M00004114C:D11 


ES 155 j 


M00004115A:G12 


ES 155 : 


M00004118C:D12 


ES 155 | 


M00004122C:D01 


ES 155 1 


M00004134A:A08 


ES 155 1 


M00004136C:B12 


ES 155 ; 
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Clone iName 


Tube 


M00004139B:rUJ 


1 bo 1 jj 


AjfAAAA/1 1 /I 1 A .T"\A1 

M0U0U4 1 4 1 A:Di) 1 


I bo 1 j j 


MUI/UU4 1 4 1 t> . DU I 


i Cj l J J 


MUUUU4 i41r>.rUo 


} bo I JJ 


\A(\(\C\(\A 1 zHR-Rfizl 


; CO 1 JJ 


MUUUU4 i^hU.dvZ 


Tec K< 

; Co 1 JJ 




i CO 1 JJ 


MUUUU4 1 4013. c.Uo 


i lZO I JJ 


IVlUUUU't 1 tDV^.DUH 


] L/O 1 J J 


M00004147C:E01 


jES 155 


M00004151B:A07 


jES 155 


M00004155A:H03 


iES 155 


M00004155C.A10 


jES 155 



M0OOO4158B:EO3 jES 155 

M00004158D:E08 jES 155 

m66664159C:DT6 jES 155 

M00004159D:F12 iES 155 

M00604160D:F06 |ES 155 

M00064160b:G05 ~ES 155 



M00004162D:F02 

M00004163B:C03 

M000an63CiA0f 

M00004164B:E12 

M00004165C:A11 

M00004166C:B10 

MO()004I69A:E04 

M00004170A:F03 



;ES 156 
iES 156 
iES 156 
r HS 156 



M66604i71B:B03 



ES 156 
ES 156 
ES 156 
ES 156 
iES 156 



M00004172C:A08 ES 156 



M00004172D:B12 



M00004172D:F04 



M00004175D:E06 



M00004176C.A09 



M00004179C.B06 



M00004179D:A12 



ES 156 



ES 156 



ES 156 



ES 156 



ESJ56 
ES 156" 



M00004187B:C02 JES 156 
M"06004189ATci2" *ESTS6" 



M00004192C:B06 



M00004195A:F07 
M00004201Dic6r 



M00004201D:C03 



M00004204C:H08 
M00004207CiAM~ 



ES 156 



ES 156 
ES 156 
ES"T56 



ESJ56 
ES 156 
EST56" 



Clone Name 



Tube 



M00004208A:D08 _jES 156 
M00004210A.A03 ES 156 
"M00004212D-C03 Tes756 



|M00004214A:E05 
M00004214D:A05 
M00004215B:C05 
M00004220D:C11 



ES 156 
ES 156 
ES 156 
iES 156 



M00004225D:E03 jES 156 



; M00004229B:B06 j ES 1 56 
! M60064230D-B05 " |ES 156 



M00004237C:D10 ES 156 
M00004242D.H01 ES 156 
M00004245C:G1 0 1 ES 1 56 



M00004246B:H07 


ES 156 


M0000425lD:D03 


ES 156 


M00004263C:D03 


ES 156 


M00004266B:F07 


ES 156 


M00004269A:F11 


ES 156 


M00004269A:G11 


ES 156 


M00004269B:B04 


ES 156 


M00004270A:E09 


t-i ACS 

ES 156 


MOOOO^ 


ES 156 


M00004277D.B02 


ES 156 


M00004278A:G06 


ES 156 


M00004278C:B10 


bS 136 


M00004281A:C04 


ES 156 


M00004282A:D01 


ES 156 


"M6bb042"82B:D07 


ES 156 


M00004282C:A12 


ES 156 


M06bo4284A:F08 


ES 156 


M06004295D:C07 " 


ES156 


M00004296B:D03 


ES156 


M00004303C:C05 


ES 156 


M00004310B.E02 


ES 156 


M00004316A.B03 


ES 156 


M00004320C:E07 


ES 156 


M00004321C:C11 


ES 156 


M00004322B:D03 


ES 156 


M00004324A:B03 


ES 156 


M00004324A:D10 


ES 156 


M00004324A:D05 


ES156 


M00004328A:D01 


ES 156 


M00004330A:A01 


ES 156 


M00004336A:A01 


ES 156 
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Clone Name 


T,,L a 

luoe 


M00004341C:A09 j 


156 


M00004341C:E05 


ES 156 


A Af\(\(\f\A 1 A A A 1 1 

M00uU4344A:(jI 1 

_ _ \ 


CO IJO 


\ A (\(\(\(\A 1 A A T\-C* 1 7 

MUUUU4 J44D.t 1 Z 


1 


X A AA A A A 1 /I 7 D . C ft A 

MU000434 /hs:bU4 


cc i c/; 
bo Ijd 


X A AAAA/17 A 7P. A A£ 


ire i 


X >f AAAA/17 C A A . A f\A 

MU0U04350A:AU4 


bo IjO 


M00004351B:G07 


bo 156 


M00004352A:D08 


bo 156 


M00004357B:B06 


ES 156 


M00004358B:G02 


ES 156 


M00004359A:E01 


Eo 156 


M00004360C:D09 


ES 156 


M00004365C:C09 


ES 156 


M00004365C:G11 


bS 156 


M00004366D.C1 1 


bS 156 


» a r\ f\f\c\ a ^> y ct a n i 1 

M00004368A:BI I 


bo 156 


M00004372A:E12 


ES 156 


M00(304376b:A12 


ES 1 56 


M00004385C:Hl2 


ES 156 


M00004393C:D06 


ES 156 


M00004406A:G09 


ES 156 


M000644l6B:GlO 


ES 156 


i iAAnA /i /i i on a 11 

M000044l8B:All 


bS 156 


M00004419A:G02 


bo 156 


M00004420D.E05 


bo 156 


M00004430A:A05 


rc i cr 

bo 156 


\ if A A A A A A ^ f\T\ .V\ 1 A 

M00004430B:B10 


bo 15/ 


M00004443C:F07 


ro 1 c "i 

ES 157 


M00004462D:D12 


r o i ch 

ES 157 


M00004502A:Dl2 


rc i a 

bo 157 


M00004507D:E03 


rc 1 C7 

bo 157 


M00004509B:B10 


r c i en 
bo 15 / 


M00004509D:C06 


rc i cn 
bo 15 / 


M00004603B:E02 


rc i en 
bo 15/ 


X /fAAAA^/A')P.r i 1 A 

M00004603C:C10 


bo 15 / 


X if A A A A A S (\/lT~\ . I FAA 

Mu0004606U:HOy 


bo 15 / 


XjTAAAA/t/TAO A .PI A 

M000046U8 A:L 1 U 


CC 1 ^"7 

bo 15 / 


A4AAAA/1^A0 A >LJA/1 


CC 1 ^7 

bo l j / 


M00004609A.E09 


ES 157 


M00023389A:G04 


ES 157 


M00023394D:D10 


ES 157 


M00026809A:H08 


ES 157 


M00026818C:E01 


ES 157 



V-^IUIIC iNalllC 


Tii Hp 


MUUUZOoJDD.rlUJ 


P*s 1 S7 


MUUUZ0o4ZD.AU 1 


LJ ID t 


Jvl W \)L O Z V . L« l/Z 


DO 1 J / 




p<; i S7 

CO I -> / 


IVlUUUZOo jOD.Uuj 


CO I J / 


MUUUZoyuuA.nu / 


CC K7 
CO 1 J / 


Mnnn?^oo7nF07 


FS 157 


muuuz oy i uo.ouo 


i S7 

CO u/ 


MUUUZOy 1 ^fCriU!7 


CO 1 J / 


MUUUzoyjuU.LU / 


Co 1 j / 


MUUUZOyO 1 A.E5UO 


. 1 

CO 1 J f 




cc 1 <7 


: IViUUUZ / UUH^.Vw I I 


cc 1 S7 

CO I J f 


; \Af\(\(\l 1(\ 1 7 A • RHQ 

; MUUUZ /U I / A.tsuy 


P<2 1 ^7 
Co 1 J / 


i IVIUUUZ /UjOA.dUO 


! 

CO 1 J / 


i IVlWvUZ / \J J v/\. DU^> 


FS 1 S7 


; MUUUZ /UjZA.JDI u 


pc 1 <7 

CO 1 J / 


MUUUZ / Uj fKs.UlK) 


FC. 1 *\7 
Co I j / 


MUUUZ /U04d.UUD 


cc 1 C7 1 
Co 1 J / 


\yinnn77nc 1 a - a 
MUUUZ /Uo 1 A. AUo 


CO 13/ 


i MUUUZ /Uy_)A.rlUZ 


Ipq 1 ^7 i 

CO 1 J / 


: mwvZ / 1 j 1 A.oU-) 


CQ 1 ^7 
CO 0 / 


j \a fiftfio*? i ^Qp-rn7 
i mvUVZ ( ljyL/.rU / 


CC 1 C7 1 

Co 1 j 1 


MUUUZ / 1 D / . D 1 U 


cc t ^7 

CO 1 J / 


i MUUUZ / IO0D.1IU6 


cc 1 <7 : 

CO 13/ 


MUUUZ / 1 /UU.l/U / 


F's 1S7 

CO 1 J 1 


1 MUUUZ / J / JtXl 1 


cc 1 S7 

CO I J / 


; MUUUZ / 1 / /D.UUt 




CO 1 J / 


\/fnn0771 7RR- A 1 1 
MUUUZ III oD. A 1 1 


F<i 1 S7 

CO 1 J / 


Mrtnn77i 8?R-nnn 

MUUUZ / l OZD.VJUO 


FS 1S7 

CO 1 J / 


= MUUUZ / 1 oy^,.D J U 


cc 1 C7 
Co 13/ 




FS 1 S7 

CO 1 J 1 


MUUUZ / Z 1 JA.ruO 


FS 1S7 


i MUUUZ /Z 1 JD.D I ^ 


FS 1S7 

CO l J 1 


i MUUUZ /^ttV^.DvU 


FS 1S7 


i MUUUZ /Z'f /t, JJUZ 


CO 1 3 / 


. A f)7 


FS 1 57 




FS 157 


f MUUUZ / Z /HA.nU" 


FS 157 


I muuuz /zyut^.ruo 


FS 1 ^7 

CO 13/ 


M00027291A:G08 


ES 157 


M00027311A:H09 


ES 157 


M00027313C:E01 


ES 157 


|M00027314D:E02 


ES 157 
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Clone Name 


Tube 


M00027316C:C03 


ES 157 


M00027319C:C03 


ES 157 


M000273I9D:F07 


rc 1 C*7 

bS 15/ 


M00027332B:H09 


ES 157 


M00027359B:A06 


CO 1 C7 

bb 15/ 


M00027363D:G04 


CO 1 C7 

bS 15/ 


M00027364B:E12 


CO 1 CI 

bS 157 


M00027376C:A02 


ES 157 


M0002738lB:B04 


CO 1 CI 

bS 157 


M00027400D:H02 


ES 157 


M00027433B:D12 


CO 1 C*7 

ES 157 


M00027457B.E1 1 


ES 157 


M00027459C:Bl0 


ES 157 


M00027467A:C07 


ES 157 


M00027475D:A01 


ES 157 


M00027480C:E09 


ES 157 


\ lAAA^T /I Of P l^AI 

M00027485C:F07 


CO 1 CI 

bb 157 


M00027506B:G01 


ES 157 


M00027513D:F06 


ES 157 


M00027523A:H05 


ES 157 


M00027527B:C05 


ES 157 


M00027549C.G03 


ES 157 


M00027569A:E05 


ES 157 


M00027586A:C09 


ES 157 


M00027589B:G07 


ES 157 


M00027591A:E04 


ES 157 


M00027600B:C07 


ES 157 


M00027605B:D09 


ES 157 


M00027688C:C0l 


ES 157 


M000277l7C:C06 


ES 157 


M00027724D:D04 


ES 157 


M00027734D:C03 


ES 157 


M00027746A:D06 


CO 1 C7 

ES i 57 


M0002780lB:D07 


co 1 n 

ES 157 


M00027806C:H05 


CO 1 f7 

bS 157 


M00028055B:G07 


ES 158 


M00028063C:H01 


CO ICO 

bb 158 


M00028067A:C11 


t~> c ICO 

ES 158 


M00028069D:H02 


CO 1 CO 

bb 15o 


A vl AAA^ 0 ATA A . LI AO 


CC 1 CO 

bo Ijo 


M00028070D-.C03 


ES 158 


M00028188C:H11 


ES 158 


M00028193B:E07 


ES 158 


M00028196A.G03 


ES 158 



\^ione iName 


Tube 


MUDUioz 1 Ub.HUi 


ES 


158 


\/IAAf\O0O1 1 A .CIA 

MUUUzoz J 1 A.r 1U 


ES 


158 


MUOUioz i 2L/.CU5 


ES 


158 


MUUUzoz iyD:HU5 


ES 


158 


MUUUzojoId.HUo 


ES 


158 


MUUUzoJbOD.bUo 


ES 


ICO 

158 


IVIUUUZoD 1 DV^.L/U" 


ES 


158 


TV A A A A 1 0 £. ^ A . (\ H 


ES 


158 


T\4O0O0£7£7 A r,1 1 


ES 


T58 


MUUUZO /Ohd.UKJj 


ES 


158 


AyfAAAOOTTt A .CA1 

MUUUz5//lA:bUz 


ES 


158 


\4nOOOC777P-PO< 
MUUUzo / / 3U.CU3 


ES 


158 


MUUUzo / /4U.t I U 


ES 


158 


MOOUzo / /7B:u(J4 


ES 


158 


\>tnno7 c7ffo a * pn i 
MUUUzo /ozA.rUI 


ES 


158 


MUUUzo 7o4A:U 1 z 


ES 


158 


A A AAA^> OTOiCt) . A A/1 

MUUUzo7ooB:AU4 


ES 


158 


HyfAAAl 1 TIAD.PAI 

MUUUJ137UB:LU1 


ES 


158 


XyfMAAl 1 yl 1 AC\- LI PkC 
MUUUj 1 4 1 OU. HU J 


ES 


158 


KrfAAAl 1 /lOyi A .CtA^ 

MUUUi 1 484A:UUi 


ES 


158 


\/!AAA1 1 /lOCD-PAC 

MUUU31455B:GU5 


ES 


158 


NyfnnniO/171 r\- ao< 
MUUUJz4 / 1 U. AUO 


ES 


158 


MUUUJZ4 /Jo.AUi 


ES 


158 


MUUUjz4 /4A.OUJ 


ES 


158 


MUUUJZ4 / j A.AUu 


ES 


158 


MUUUJz4oVt>.U 1Z 


ES 


158 


MUUUJ z4yuu . bUo 


ES 


158 


^>^AAA17/10y1^ , ■UAC 

MUUUi Z4V4U.HU5 


ES 


158 


X/fAAA'iO/IO^Ci.D 1 A 

MUUUi z4y /U.blU 


ES 


158 


X/TOAOTO^Oyin'n 1 o 

MUUUiz5U4t>.t> 1 U 


ES 


158 


\A A A Al 7 ^ A7 Vl • ^iO C 

MUUUizDU /fU.UUo 


ES 


158 


TV >f AAAT1C AO A -CA1 

MUUU3z5UoA;bUJ 


ES 


158 


lVAAAAlO^ 1 ^ A -n 1 7 
MUUUizj 1 OA.olz 


ES 


158 


AyfAAA17< 1 1 A 
MUUUjzjI /CbiU 


ES 


158 


TA/fAAATO^ 1 QH-PAfi 

muuujz j i yu.ruo 


ES 


158 


lV/f AAA17C2/1D'rr 1 7 

MUUUJz5J4D.blz 


ES 


158 


MUUU3zj41CUUi 


ES 


1 58 


AAA17CC.7 A • A A7 

MUUU3z5jjA.AU / 


ES 


158 


\400017^^An* A01 
MUUUjZj jOU.AU j 


ES 


158 


TVylAAA'37C.*t7/"^.CA 1 

MUUUJz5ozC.rU 1 


ES 


158 


M00032567B:G05 " 


ES 


158 


M00032568B:F08 


ES 


158 


M00032577D:F01 


ES 


158 \ 


M00032580D:A09 


ES 


158 
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Clone Name 


lube 


M00032581B.A09 


co i f o 

ES 158 


M00032584A:D06 


ES 158 


\A Artrtl 7 C O^/" 1 .!! A/f 

M00032jo6C:d04 


re ICO 

bo Ijo 


M00032590B:H01 


co i co 

ES 158 


MU0032jy4L:r0j 


CC 1 CO 

bo I jo 


Avf AAA77C07 A .LJA7 

M0003259 /A:H02 


1 Cfi 

bo Ijo 


* jiaaatt^aco ,r\rtn 
M00032oOjo:DOy 


CC 1 CO 

bo Ijo 


M00032613A:E1 1 


1"" O ICO 

ES 158 


M00032ol4L:B10 


CC ICO 

bS Ijo 


M00032614D:D08 


co ico 

ES 158 


M00032620B:F06 


c o ico 

ES 158 


M00032621 A:rl 1 


CC ICO 

bb Ijo 


M00032628C:B06 


r" o ico 

ES 158 


M00032634B:D09 


rr ico 

ES 158 


M00032637A:F09 


CC 1 CO 

bS Ijo 


M00032638B:F02 


VC* ICO 

ES 158 


M00032644C:B05 


ES 158 


M00032645D:COl 


co i co 

ES Ijo 


M00032647B:F06 


r-< o ICO 

ES 158 


M00032652C:C07 


ES 158 


M00032666A:C02 


ES 158 


M00032671B.D06 


ES 158 


M00032671B.D08 


ES 158 


M00032676C:C10 


CC 1 CO 

bS Ijo 


M00032688C:A03 


co i co 

ES Ijo 


M00032700A:b09 


co t co 

ES Ijo 


x lAAATnATrv.rno 

M00032707D:F08 


nc i co 
ES Ijo 


M0003271 1B:F01 


CC 1 CO 

ES Ijo 


i iAAAinnr\ i iai 

M00032723D:H02 


CC 1 CO 

ES 158 


x jtaaat^t^^ a .CA/i 

M00032727A:E04 


CC ICO 

ES Ijo 


M00032728D:F0l 


f O ICO 

ES 158 


M00032729A:FlO 


T™> O ICO 

ES 158 


M00032733B.T 12 


CC 1 CO 

bS Ijo 


M00032734B:E12 


CC 1 CO 

bS Ijo 


X IAAA11T) ^r.TAC 

M0003 2 73 4C :C05 


CC ICO 

bS Ijo 


X lAAAin/l An./°A1 

M00032749D:O03 


CC t CO 

bS Ijo 


AyfAAA777C5 A ./"'A7 

M00032 /j3A:LU / 


CC 1 CO 

bo Ijo 


X f AAAI^TTA A A AO 

M00032759A:A03 


CC 1 CO 

bS Ijo 


AjIAAATn/CC A ,PAC 


CO 1 CO 

bS Ijo 


M00032770C:G11 


ES 158 


M00032772D:D03 


ES 158 


M00032773D:F08 


ES158 


M00032774C:C04 


ES 158 


M00032787D.C05 


ES 159 



x^ione lNttUic 




A4AAA177Q1 Q-lJl 1 

MUUUJ2 fy 1 D.rii J 


CC 1 CO 


MUUUJ2 fy iD.rvl 


CC 1 CO 




CC 1 CO 

co ijy 


\/f AAA'2770'2 A -nn£ 


CC 1 CQ 

bS ijy 


iviuuujz fyj\s./\\jj 


pel CO ■ 
E/o i jy 


MUUUjZ /V /U.UUo 


CC 1 
to IjV 


iviuuu j zozjD.ruo 


CC I CQ : 


IVllFUlFjZoZOV^.L' l U 


cc r^o 

Co IJV 


MUUUjZoZOA. AUO 


CO 1 JV 


NyfAAA'JOOOOCt. A AC 

M00032o2VlJ.AU j 


CC 1 CO 

co lJ7 


MVJUU J Z 5 J UU . UUZ 


CC 1 1 

co j jy 


N/tnnn^oft^ i a -cc\i 

WlxJKJKJjiLo J J A.LU / 


CC 1 ^0 

Co ijy 




CC 1 ^0 
Co ijy 


A-tAAAIOCJ CPk./^A/l 

M00032o35IJ:uu4 


CC 1 CO 

bS [jy 


muuujzojod.au / 


PC 1^0 
Co \jy 


M00032o4oL):b 10 


CC 1 CQ 

bS \jy 


M00032o92L:C 12 


CC 1 CO 

bS IjV 


XyfAAA'JOAAO A .P\A0 

M00032v0oA:U05 


CC 1 CO 

bS \jy 


\>f AAA^OA 1 Or\.DAyt 

M0003291oL):d04 


CC 1 CO 

bs ijy 


a a aaai ini op.nm 
M0003 292 oC : D02 


CC 1 CO 

bS IjV 


A^AAAOOA/l /I A -DA7 

M00032V44A.dO / 


CC 1 CO 

bS IjV 


\>f AAATOA/1 CT"\.DA7 

M00032y45D:bU / 


CC 1 CO 

bs ijy 


MUUUj2V/yU.Cl 1 


CC 1 CQ 

bo ijy 


Ay! A AfH 0 G 70 n • 14 A 7 

muuujzv /yu.iiu / 


CC 1 ^0 

co ijy 


A/f aa f\7 7Q o c n • r: AO 
Muuujzyoju.ouy 


CC I CQ 

co ijy 


\yfAAA^7Qft7R-PA t 


CC t^o 
Co ijy 


\AAAA170ftfiR*riA 1 
MUUUjZVooD.vjU I 


cc 1 ^o ! 
co ijy 


AyfAAAllAA^ A *C 1 A 
MUUUjjUUoA.r 1U 


cc 1 ^6 
to ijy 


\A AAA^ ^7°.^* A A7 
MUUUj JUZov_/.AUZ 


CC 1 ^0 

Co ijy 


lVvfAAA'i 1 A70C\.r^ 1 A 

MUUU33U2oU.C 1 U 


CC 1 CQ 

bo ijy 


AAAAA11A17P-CAA 
MUUUjjUj /t).rU4 


CC 1 ^0 

co ijy 


AAAAA1^A/11 A-R1 1 
1VJUUUJ jU4l A.t) 1 1 


PC 1 ^0 

co ijy 


\AAAA11A^^n-nA7 
1V1 UUU J J U J JU. \J\)L 


PC 1 ^Q 

co ijy 

... 


\yfAAAl^A7 1 P-HA^ 
MUUUJ JU / It.uUj 


Co / jy 


A Aft 'A A 5 ^ A 7 1 D • C A » 


PC 1 so 1 
co ijy 


X4AAA33A7? A • A AO 
lvlUUUjJU 1 Lt\.t\\)7 


CO 1 J7 


MAAAl^ ARAP- A A7 


PC ICQ 

co i jy 


MAAAl^A81D*ni 1 

IVlUvUJ JuO 1 U.LJ 1 1 


CC ICQ 

CO 1 J7 


MAAA'H 1 AAA-HA? 


co ijy 


\4AAA17 1 A £1"V A (\1 

MUUU33 14olJ.AU3 


CC 1 CQ 

bo IjV 


M00033147C:B08 


ES 159 1 


M00033149B:E10 


ES 159 I 


M00033150B:E02 


ES159 


M0OO33150C:All 


ES 159 
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Clone Name 


Tube 


M00033183B:F10 


ES 159 


M00033218C:F07 


ES 159 


M00033223C:G04 ES 159 


M00033230L:Cj1U 


cc i cg 
bo ijy 


* A AAA"7 OTJ^D./^AQ 

Muuu33z3zd:Cu«> 


EC 1 CQ 

Co ijy 


M00033246A:Hl2 


CC 1 CO 

bo I5v 


MUuuJJz4oU.Hl 1 


CO IjV 


M00033264B:E06 


trc i co 
bo 159 


xa'aaa'} in/i n.cm 
M000332/4U:rl)j 


PC 1 CA 

bo Ijv 


M0003331 1B:G10 


CC 1 CO 

bo 159 


M00033324B:rU4 


CC 1 CO 

bb ijV 


M00033326B:B05 


bS 1 dv 


M00633329C:C02 " 


re 1 co 

bo 159 


M00033296C:C1 1 


CC 1 CO 

bo 159 


M00033302A:El l 


CC 1 CO 

bo I 59 


M00033302B:F10 


CC 1 CO 

bo 159 


M00033303C:F09 


IT" 1 CA 

ES 159 


M00033342B:F03 


CO 1 CA 

ES 159 


M00033344A:B06 


r-< o 1 f A 

ES 159 


M00033359C:H05 


C C 1 C A 

ES 159 


M00033360C:A03 
M00033374D:C07 


CC 1 CO 

bS 159 


CC 1 CO 

bo 159 


M00033413A.A08 


CC 1 CO 

bo 159 


M00033420B:E08 


CC 1 CO 

bo 159 


M00033434D:r05 


CC 1 CO 

bo 159 


M0003344lA:Bl2 


r"p 1 ca 

bo 159 


ft /f AAA1 ^ A A fr\.PA') 

M00033445D:G03 


CC 1 CO 

bo 159 


M00038290A:DI2 


CC 1 CO 

bo 159 


X 4 A A A "> O "> A /I O ,C A1 

M00U383O4B:buz 


CC 1 CO 

bo 15V 


\ 1AAA1 O "> OAA . r*\ 1 A 

M00038389D:D10 


CC 1 CO 

bo 159 


\ jf AAA"? OTfiAD.UAl 

M0U03o39ur3:ruz 


CC 1 CO 

bo ijy 


M00038616C:C09 


CC 1 CO 

bo 159 


A jfAAAl QjC 1 /CF\. D A^7 


EC 1 CO 


XjfAAATO/TI or\.nAO 

M00Q3861oD:Du5 


CC 1 CQ 

bo \jy 


XylAAAl Q£^ QD • COG 


PC 1 

CO Ijy 


M00038619D.O2 


r^o 1 ca 

ES 159 


M0003900lA:BlO 


ESl59 


M00039024D:El2 


ES 159 


M00O39055C:A0l 


ES 159 


M00039056B:G0l 


ES 159 


M00039063C:H09 


ES159 


M00039067A-.C05 


ES 159 


M00039067B:F07 


ES159" 


M00039076D:Gb4 


ES 159 



Clone Name 


Tube 


Mn00^9fi78RR01 

IVIUuUJyu / OU.JJvJ 


F^ 
DO 


1 so 

1 jy 




CC 

C/O 


1 J7 


M006l9081B C04 


"cc 

C-O 


ISO 


M00019081 B-n07 


F^ 


ISO 

uy 


MOOOIQ 1 00 A *G04 


FS 


ISO 


M0 00391 6 5 D - A 6 8 " 


FS 
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